INDEX
Explanations
terms related to leadership positions
references to leadership positions or titles
New Auto-Interp
Negative Logits
eros
-0.73
mell
-0.65
ĸļ
-0.64
aneously
-0.63
icity
-0.60
Strat
-0.59
selves
-0.59
PsyNetMessage
-0.59
luckily
-0.58
subp
-0.58
POSITIVE LOGITS
quarter
1.15
lining
1.10
canon
0.98
gear
0.97
scar
0.95
liner
0.93
hun
0.92
phones
0.88
strong
0.87
pin
0.87
Activations Density 0.025%