INDEX
Explanations
references to victimization and oppression
New Auto-Interp
Negative Logits
YLE
-0.16
o
-0.15
Imper
-0.14
SCR
-0.14
Sik
-0.14
Sick
-0.14
sey
-0.14
agendas
-0.14
alf
-0.13
luk
-0.13
POSITIVE LOGITS
éģĩ
0.15
ijľ
0.15
edException
0.15
ooter
0.15
kate
0.14
dealloc
0.14
USTER
0.14
udi
0.14
ovny
0.14
orte
0.13
Activations Density 0.436%