INDEX
Negative Logits
dividir
-0.08
_SCOPE
-0.08
-0.07
justification
-0.07
restant
-0.07
(history
-0.07
marine
-0.07
suk
-0.07
inget
-0.07
esclarecer
-0.07
POSITIVE LOGITS
birbir
0.09
cente
0.09
culo
0.08
ely
0.08
ocities
0.08
राजा
0.08
Parks
0.08
Eto
0.08
Recipro
0.08
feathers
0.08
Activations Density 0.057%