INDEX
Explanations
terms related to protests and social conflicts
New Auto-Interp
Negative Logits
WAYS
-0.16
vant
-0.15
zÃŃ
-0.15
ien
-0.15
Incontri
-0.14
hôn
-0.14
ük
-0.13
atif
-0.13
Blasio
-0.13
λι
-0.13
POSITIVE LOGITS
calls
0.32
widespread
0.27
growing
0.26
intense
0.24
calls
0.23
criticism
0.23
Calls
0.23
renewed
0.22
fresh
0.22
Calls
0.22
Activations Density 0.100%