INDEX
Explanations
mentions of political figures, especially in relation to leadership roles
New Auto-Interp
Negative Logits
odb
-0.17
odic
-0.16
ICON
-0.16
poÄį
-0.15
spect
-0.14
ÙijÙı
-0.14
å¾ħ
-0.14
eyn
-0.14
ово
-0.14
046
-0.14
POSITIVE LOGITS
Ïĩε
0.14
æį
0.14
ÏĨι
0.14
енноÑģÑĤи
0.14
vang
0.14
hti
0.13
acio
0.13
poll
0.13
.Lines
0.13
fullPath
0.13
Activations Density 0.002%