INDEX
Negative Logits
ır
1.27
{1.10
AD
1.09
are
1.08
is
1.02
aries
1.01
ae
1.00
我
0.97
هُ
0.97
你
0.96
POSITIVE LOGITS
지
1.76
ן
1.44
s
1.30
ม
1.11
่า
1.09
יי
1.05
ა
1.05
に
1.05
r
1.04
expression
1.02
Activations Density 0.013%
ır
{AD
are
is
aries
ae
我
هُ
你
지
ן
s
ม
่า
יי
ა
に
r
expression