INDEX
Negative Logits
are
0.75
as
0.74
4
0.73
is
0.70
be
0.67
3
0.65
of
0.65
),
0.63
5
0.62
።
0.62
POSITIVE LOGITS
و
0.72
おそらく
0.61
น
0.60
और
0.58
同じ
0.58
기와
0.57
스와
0.57
その
0.55
스가
0.54
이니
0.54
Activations Density 0.345%
are
as
4
is
be
3
of
),
5
።
و
おそらく
น
और
同じ
기와
스와
その
스가
이니