INDEX
Negative Logits
dư
-0.07
zung
-0.07
=Y
-0.06
함
-0.06
+self
-0.06
herself
-0.06
getTable
-0.06
landı
-0.06
joy
-0.06
Would
-0.06
POSITIVE LOGITS
unregister
0.07
_Al
0.06
SETTINGS
0.06
OTO
0.06
onlara
0.06
ynomial
0.06
meine
0.06
uid
0.06
_protocol
0.06
489
0.06
Activations Density 0.000%