INDEX
Negative Logits
russian
-0.09
joindre
-0.08
rubber
-0.08
ผู้
-0.07
Jo
-0.07
chines
-0.07
Roll
-0.07
hil
-0.07
dependent
-0.07
-0.07
POSITIVE LOGITS
boasting
0.10
_UNLOCK
0.09
_PHY
0.08
ਣਾ
0.08
ence
0.08
emocion
0.08
脸
0.08
tenga
0.08
.atomic
0.08
ాలను
0.08
Activations Density 0.001%