INDEX
Negative Logits
žo
-0.91
multer
-0.90
piratas
-0.84
verden
-0.84
cytok
-0.83
Arma
-0.82
exclusivos
-0.82
quiao
-0.81
colm
-0.81
slant
-0.81
POSITIVE LOGITS
ref
0.77
とし
0.75
Ausgaben
0.73
ᑭ
0.72
sincerely
0.69
enough
0.68
accomplish
0.67
pite
0.66
Niedersch
0.66
while
0.66
Activations Density 0.000%