INDEX
Negative Logits
Kürt
-0.07
Hari
-0.06
shocked
-0.06
FN
-0.06
omaly
-0.06
Belfast
-0.06
uma
-0.06
matchups
-0.06
vagina
-0.06
_sms
-0.06
POSITIVE LOGITS
Specifically
0.07
################################################################################
0.06
ém
0.06
rencontrer
0.06
strut
0.06
entially
0.06
ność
0.06
dependence
0.06
Extension
0.06
smarty
0.06
Activations Density 0.007%