INDEX
Negative Logits
autorytatywna
-0.78
تقاوى
-0.76
expandindo
-0.76
beginnetje
-0.73
EDEFAULT
-0.71
ⓧ
-0.70
Enllaces
-0.70
:✨
-0.69
ویکیپدی
-0.69
awaiter
-0.69
POSITIVE LOGITS
ſtand
0.77
deſt
0.72
faſt
0.69
ſelves
0.64
ſet
0.63
fhew
0.63
themſelves
0.63
cauſe
0.62
juſ
0.62
zeba
0.61
Activations Density 0.178%