INDEX
Explanations
defining categories or states
New Auto-Interp
Negative Logits
សម្រ
0.45
ಮುಂದ
0.43
appunto
0.43
ol
0.43
പ്രത്യേക
0.43
0.43
గానే
0.42
aparikkh
0.42
is
0.41
तरह
0.41
POSITIVE LOGITS
،
0.71
,
0.63
(
0.55
5
0.55
、
0.54
(
0.54
6
0.54
9
0.51
=
0.50
(
0.50
Activations Density 0.154%