INDEX
Explanations
references to ordinal indicators or positions in a sequence
New Auto-Interp
Negative Logits
egin
-0.15
rine
-0.15
با
-0.14
hữu
-0.14
fisse
-0.14
.setter
-0.13
rint
-0.13
eness
-0.13
вид
-0.13
amage
-0.13
POSITIVE LOGITS
arily
0.18
reno
0.17
ousand
0.17
/th
0.17
-generation
0.17
ousands
0.17
oley
0.15
icks
0.15
-largest
0.15
ments
0.15
Activations Density 0.042%