INDEX
Negative Logits
(
0.90
يل
0.86
that
0.83
v
0.82
\
0.77
of
0.75
zuf
0.72
be
0.69
л
0.68
about
0.68
POSITIVE LOGITS
在
1.02
ও
0.98
to
0.91
도
0.84
however
0.82
ки
0.82
いえ
0.79
지
0.79
もら
0.76
きた
0.73
Activations Density 0.024%
(
يل
that
v
\
of
zuf
be
л
about
在
ও
to
도
however
ки
いえ
지
もら
きた