INDEX
Explanations
references to protests and related civic unrest
New Auto-Interp
Negative Logits
ÑĢиг
-0.15
ewood
-0.15
Revel
-0.15
мÑĥ
-0.14
roids
-0.14
Podesta
-0.14
rides
-0.14
abyrin
-0.14
leigh
-0.13
Usa
-0.13
POSITIVE LOGITS
agitation
0.27
ag
0.24
demand
0.20
demands
0.19
demanding
0.19
demand
0.19
Demand
0.19
indefinite
0.18
frontal
0.18
stir
0.18
Activations Density 0.058%