INDEX
Negative Logits
trick
-0.08
spæ
-0.07
English
-0.07
músc
-0.07
însă
-0.07
निक
-0.07
leaps
-0.07
upal
-0.07
norms
-0.07
samples
-0.07
POSITIVE LOGITS
selalu
0.09
aina
0.09
lada
0.09
menjaga
0.09
dikkat
0.09
-Encoding
0.08
Selain
0.08
kuhakikisha
0.08
tabbatar
0.08
Besides
0.08
Activations Density 0.004%