INDEX
Negative Logits
istin
0.38
chảy
0.38
schnelle
0.38
anza
0.37
schneller
0.37
icional
0.36
āta
0.36
schnell
0.36
tief
0.36
ിക
0.35
POSITIVE LOGITS
Another
0.40
काफी
0.38
FORT
0.37
DialogWhenLarge
0.37
گ
0.37
protagonists
0.37
The
0.36
equivoc
0.36
va
0.36
iPhones
0.36
Activations Density 0.001%