INDEX
Explanations
mathematical symbols and notations in equations
New Auto-Interp
Negative Logits
-0.83
-0.83
of
-0.76
,
-0.76
<eos>
-0.76
and
-0.76
is
-0.75
also
-0.74
in
-0.72
,
-0.69
POSITIVE LOGITS
kaarangay
1.48
Autoritní
1.39
Савезне
1.38
Paglinawan
1.38
Roskov
1.31
__':
1.26
1.26
autorytatywna
1.24
myſelf
1.23
ArrowToggle
1.20
Activations Density 3.525%