INDEX
Explanations
proper nouns related to roles, names, and titles
references to leadership or authoritative figures
New Auto-Interp
Negative Logits
PsyNetMessage
-0.91
ickr
-0.79
mell
-0.77
resil
-0.71
Avg
-0.68
vo
-0.65
suppl
-0.65
chloride
-0.63
ktop
-0.62
bottleneck
-0.61
POSITIVE LOGITS
quartered
1.10
phones
1.09
Head
1.03
quarters
1.03
quarter
1.03
ache
1.02
Head
0.96
lining
0.94
canon
0.94
butt
0.94
Activations Density 0.005%