INDEX
Explanations
phrases related to political controversies and actions taken against public figures
New Auto-Interp
Negative Logits
atsu
-0.15
iban
-0.15
nop
-0.14
bah
-0.14
ovi
-0.14
Vie
-0.13
èĦ
-0.13
Æł
-0.13
.namespace
-0.13
quip
-0.13
POSITIVE LOGITS
Suff
0.16
Hospitality
0.15
Zahl
0.14
Germ
0.14
alle
0.14
rc
0.13
trÃŃ
0.13
citiz
0.13
suff
0.13
orna
0.13
Activations Density 0.204%