INDEX
Explanations
expressions of political dissatisfaction and calls for social change
New Auto-Interp
Negative Logits
ä»ĬæĹ¥
-0.15
±
-0.15
rlen
-0.14
Monaco
-0.14
erde
-0.14
ستÙĩ
-0.14
ç±į
-0.13
arpa
-0.13
ahn
-0.13
.ext
-0.13
POSITIVE LOGITS
atest
0.15
ourselves
0.14
orientations
0.14
ÙĤاÙħ
0.14
ÙĨظاÙħÛĮ
0.13
monopoly
0.13
mdi
0.13
Forms
0.13
local
0.13
reative
0.13
Activations Density 0.023%