INDEX
Negative Logits
utafitiHapana
-0.94
pleaſure
-0.92
myſelf
-0.89
Jefus
-0.88
Monfieur
-0.86
houſe
-0.85
snippetHide
-0.85
سكانية
-0.82
purpoſe
-0.82
ſche
-0.82
POSITIVE LOGITS
minecraft
0.44
-
0.44
formik
0.44
lon
0.43
lay
0.42
rupal
0.42
த்த
0.42
lag
0.41
Tew
0.40
허
0.40
Activations Density 0.001%