INDEX
Negative Logits
is
1.09
1.02
was
0.98
of
0.97
to
0.97
were
0.97
had
0.91
has
0.87
I
0.86
for
0.85
POSITIVE LOGITS
lined
0.86
sthe
0.85
ა
0.82
ка
0.82
لون
0.82
that
0.80
່ວນ
0.76
посмотреть
0.76
ζει
0.75
を行います
0.75
Activations Density 0.002%
is
was
of
to
were
had
has
I
for
lined
sthe
ა
ка
لون
that
່ວນ
посмотреть
ζει
を行います