INDEX
Negative Logits
-0.10
are
-0.09
,
-0.09
(
-0.08
an
-0.08
(nn
-0.08
.
-0.08
a
-0.07
ваши
-0.07
submitting
-0.07
POSITIVE LOGITS
-to
0.14
to
0.13
to
0.13
_to
0.12
gotta
0.11
To
0.11
—to
0.11
_TO
0.11
TO
0.11
To
0.11
Activations Density 1.784%
are
,
(
an
(nn
.
a
ваши
submitting
-to
to
to
_to
gotta
To
—to
_TO
TO
To