INDEX
Negative Logits
a
1.22
to
1.22
z
1.13
a
1.07
the
1.06
in
1.05
i
1.05
txt
1.02
-
0.98
toare
0.97
POSITIVE LOGITS
gauges
1.46
ри
1.38
'
1.25
gages
1.23
ι
1.22
gauge
1.21
gauging
1.16
人
1.15
લ
1.14
ก
1.09
Activations Density 0.001%
a
to
z
a
the
in
i
txt
-
toare
gauges
ри
'
gages
ι
gauge
gauging
人
લ
ก