INDEX
Negative Logits
this
-0.92
København
-0.88
because
-0.87
before
-0.86
in
-0.86
as
-0.85
This
-0.84
during
-0.81
one
-0.80
two
-0.79
POSITIVE LOGITS
styled
0.96
mascota
0.90
incred
0.90
trám
0.88
BarStyle
0.87
离开
0.86
wiele
0.86
errores
0.84
让她
0.83
𓐍
0.82
Activations Density 0.003%