INDEX
Explanations
instances of protests or movements against rights violations
New Auto-Interp
Negative Logits
udder
-0.15
галÑĸ
-0.14
æľīåħ³
-0.14
ly
-0.14
MZ
-0.13
heimer
-0.13
ÙIJÙĥ
-0.13
डर
-0.13
ochen
-0.13
Suites
-0.13
POSITIVE LOGITS
Jeg
0.16
بÙĩ
0.15
elter
0.15
ÙĢ
0.15
Pey
0.15
-ı
0.14
Persian
0.14
ÑĦÑĤ
0.14
mehr
0.14
.ir
0.14
Activations Density 0.002%