INDEX
Negative Logits
essage
-0.07
esk
-0.06
intentionally
-0.06
톡
-0.06
sharedApplication
-0.06
discs
-0.06
Belediye
-0.06
altru
-0.06
Domestic
-0.06
вели
-0.06
POSITIVE LOGITS
xFFFFFF
0.07
ışı
0.06
Fred
0.06
Auckland
0.06
خدام
0.06
foreclosure
0.06
например
0.06
*****↵↵
0.06
""" ↵ ↵
0.06
Ан
0.06
Activations Density 0.001%