INDEX
Negative Logits
bæði
-0.09
ity
-0.09
También
-0.08
ndl
-0.08
veedor
-0.08
זו
-0.08
에도
-0.08
izde
-0.08
uillez
-0.08
Dusche
-0.08
POSITIVE LOGITS
a
0.14
an
0.14
some
0.13
frankly
0.12
possibly
0.12
literally
0.12
alot
0.12
the
0.11
awhile
0.10
Some
0.10
Activations Density 0.018%