INDEX
Explanations
references to titles or figures associated with authority and leadership
New Auto-Interp
Negative Logits
bine
-0.18
hread
-0.18
è
-0.17
à¸ģลาà¸ĩ
-0.15
eros
-0.15
ullah
-0.15
eydi
-0.15
å´
-0.15
uropean
-0.15
oose
-0.14
POSITIVE LOGITS
ship
0.36
ships
0.34
kip
0.23
lings
0.22
SHIP
0.21
Protector
0.21
Mayor
0.20
Voldemort
0.20
Lieutenant
0.20
ly
0.19
Activations Density 0.015%