INDEX
Explanations
mentions of political campaigns
New Auto-Interp
Negative Logits
ansson
-0.17
andom
-0.17
annes
-0.16
.contentMode
-0.16
åĢĻ
-0.15
ception
-0.15
omba
-0.15
utdown
-0.15
zelf
-0.15
istance
-0.15
POSITIVE LOGITS
er
0.24
trail
0.19
agne
0.18
ers
0.18
trail
0.17
aigned
0.17
agna
0.16
REFIX
0.16
Trail
0.15
Trail
0.15
Activations Density 0.017%