INDEX
Explanations
references to political dissatisfaction and calls for change
New Auto-Interp
Negative Logits
kháng
-0.15
famously
-0.15
ersen
-0.15
vang
-0.14
bbe
-0.14
ighbour
-0.14
leneck
-0.14
ç¼
-0.14
uropean
-0.14
zast
-0.13
POSITIVE LOGITS
arkin
0.17
SEP
0.15
WS
0.15
slu
0.15
SEP
0.15
akest
0.15
.nodeType
0.14
583
0.13
layer
0.13
jur
0.13
Activations Density 0.002%