INDEX
Explanations
terms related to political discourse and themes
New Auto-Interp
Negative Logits
elden
-0.18
hazi
-0.18
Ñľ
-0.17
ey
-0.16
æľŁ
-0.15
ederland
-0.15
ãĥ©ãĥ¼
-0.14
缮
-0.14
sid
-0.13
avor
-0.13
POSITIVE LOGITS
lob
0.15
nÄħ
0.15
hower
0.15
exped
0.14
genic
0.14
-economic
0.13
vented
0.13
ToLocal
0.13
oidal
0.13
uste
0.13
Activations Density 0.033%