INDEX
Negative Logits
ת
1.05
↵
0.90
u
0.89
pubblica
0.86
ין
0.84
ה
0.84
ان
0.82
an
0.81
a
0.80
,
0.80
POSITIVE LOGITS
be
1.19
an
1.06
ö
0.80
at
0.80
ెస్
0.75
a
0.73
awt
0.73
২০
0.72
\
0.71
ất
0.69
Activations Density 0.002%
ת
↵
u
pubblica
ין
ה
ان
an
a
,
be
an
ö
at
ెస్
a
awt
২০
\
ất