INDEX
Explanations
phrases related to protests and public dissent
New Auto-Interp
Negative Logits
OA
-0.15
IMER
-0.15
ause
-0.15
.compat
-0.14
OLE
-0.14
обоÑĢ
-0.14
overcoming
-0.14
sounds
-0.14
ëłĪìĿ´
-0.14
.examples
-0.14
POSITIVE LOGITS
perceived
0.32
alleged
0.20
allegedly
0.20
lack
0.19
perceive
0.17
handling
0.17
perce
0.17
lack
0.16
treatment
0.16
abbit
0.15
Activations Density 0.235%