INDEX
Negative Logits
ăpadă
0.77
eril
0.76
kendini
0.75
cytology
0.74
H
0.74
apadani
0.73
variación
0.73
pozorn
0.72
にっいて
0.71
penampilan
0.71
POSITIVE LOGITS
oh
0.98
Oh
0.94
gosh
0.88
jaw
0.81
ओह
0.80
hey
0.79
डू
0.78
glad
0.76
बदला
0.74
doom
0.72
Activations Density 0.000%