INDEX
Explanations
references to political figures and their actions in the context of elections
New Auto-Interp
Negative Logits
ras
-0.17
isco
-0.17
ITER
-0.16
imb
-0.15
enter
-0.14
ì¤Ģ
-0.14
.prot
-0.14
instrument
-0.14
ob
-0.14
Ver
-0.13
POSITIVE LOGITS
tsky
0.19
DSA
0.18
iverz
0.16
demands
0.16
zia
0.15
quelle
0.15
gressive
0.15
DNC
0.15
franca
0.14
deg
0.14
Activations Density 0.177%