INDEX
Negative Logits
as
1.62
of
1.40
л
1.34
in
1.26
ب
1.22
’”
1.17
ב
1.16
of
1.15
at
1.14
hypersurfaces
1.13
POSITIVE LOGITS
↵
1.23
ิ
1.17
ри
1.11
িম
1.09
EM
1.03
ﺭ
1.02
{1.01
prende
0.93
ן
0.92
0.92
Activations Density 0.000%
as
of
л
in
ب
’”
ב
of
at
hypersurfaces
↵
ิ
ри
িম
EM
ﺭ
{prende
ן