INDEX
Explanations
various occurrences of the word "head" at differing activation levels
references to leadership positions or titles
New Auto-Interp
Negative Logits
PsyNetMessage
-0.70
Avg
-0.69
resil
-0.67
mell
-0.67
Strat
-0.61
estab
-0.61
Suc
-0.60
ĸļ
-0.60
Constructed
-0.60
Mub
-0.59
POSITIVE LOGITS
canon
1.10
quarter
1.07
scar
0.98
heading
0.98
gear
0.97
phones
0.96
butt
0.96
jack
0.93
lining
0.92
liner
0.91
Activations Density 0.025%