INDEX
Negative Logits
C
1.32
F
1.30
et
1.09
ak
1.08
aş
1.08
B
1.06
শুধ
1.05
Однако
1.05
T
1.03
אם
1.02
POSITIVE LOGITS
with
1.62
in
1.48
the
1.46
all
1.35
a
1.30
on
1.27
which
1.25
as
1.22
from
1.22
to
1.20
Activations Density 0.120%
C
F
et
ak
aş
B
শুধ
Однако
T
אם
with
in
the
all
a
on
which
as
from
to