INDEX
Negative Logits
↵↵
1.27
↵
1.16
.
1.06
ת
0.91
_
0.90
ী
0.89
ak
0.83
x
0.82
3
0.81
)
0.80
POSITIVE LOGITS
was
0.88
to
0.88
तरह
0.79
ുമോ
0.76
inama
0.75
리그
0.75
were
0.73
سون
0.73
ᅨ
0.73
histo
0.72
Activations Density 0.021%
↵↵
↵
.
ת
_
ী
ak
x
3
)
was
to
तरह
ുമോ
inama
리그
were
سون
ᅨ
histo