INDEX
Explanations
references to political leaders and their actions
New Auto-Interp
Negative Logits
heim
-0.15
AGMA
-0.15
ossier
-0.15
jac
-0.15
nanny
-0.14
:param
-0.14
urm
-0.14
åĸ
-0.13
zem
-0.13
protest
-0.13
POSITIVE LOGITS
himself
0.19
¶Į
0.18
cabinet
0.17
Cabinet
0.17
personally
0.17
uet
0.15
uctor
0.15
ioxide
0.15
ween
0.14
Carrier
0.14
Activations Density 0.338%