INDEX
Negative Logits
ﻨ
0.68
ﺩ
0.66
később
0.65
později
0.63
o
0.60
ad
0.59
ﺱ
0.59
sin
0.58
the
0.57
ography
0.57
POSITIVE LOGITS
can
0.91
ח
0.89
ع
0.81
’
0.79
ال
0.78
א
0.78
is
0.77
માં
0.77
ح
0.77
,
0.76
Activations Density 0.020%
ﻨ
ﺩ
később
později
o
ad
ﺱ
sin
the
ography
can
ח
ع
’
ال
א
is
માં
ح
,