INDEX
Explanations
titles and positions related to leadership roles
New Auto-Interp
Negative Logits
tingham
-0.17
loub
-0.15
ÙĬÙ쨩
-0.15
anism
-0.15
éłŃ
-0.14
ypse
-0.14
ivor
-0.14
attery
-0.14
berman
-0.14
canf
-0.14
POSITIVE LOGITS
lining
0.27
hon
0.25
master
0.24
liner
0.23
hunt
0.22
quartered
0.21
strong
0.21
shot
0.20
lined
0.20
lin
0.20
Activations Density 0.010%