INDEX
Negative Logits
or
1.50
to
1.32
and
1.29
in
1.18
E
1.10
K
1.09
𝐓
1.06
as
1.05
R
1.02
J
1.00
POSITIVE LOGITS
bitterly
1.38
.
1.35
요
1.31
ྔ
1.26
m
1.24
↵
1.22
৪
1.22
mL
1.17
뭔
1.16
b
1.16
Activations Density 0.000%
or
to
and
in
E
K
𝐓
as
R
J
bitterly
.
요
ྔ
m
↵
৪
mL
뭔
b