INDEX
Negative Logits
later
0.66
mak
0.59
그다음에
0.59
under
0.58
Then
0.58
then
0.57
sometimes
0.57
abor
0.55
بهعنوان
0.55
然后在
0.55
POSITIVE LOGITS
slechts
0.88
ONLY
0.85
endast
0.84
yalnızca
0.82
všetky
0.82
pouze
0.79
تمامی
0.78
only
0.77
somente
0.74
лишь
0.74
Activations Density 0.000%