INDEX
Explanations
comparisons and superlatives referencing quantity or characteristics
New Auto-Interp
Negative Logits
reation
-0.16
iliki
-0.15
hạng
-0.15
rena
-0.15
oton
-0.14
_lineno
-0.14
relative
-0.14
ali
-0.14
isu
-0.14
relative
-0.13
POSITIVE LOGITS
others
0.18
others
0.17
AUSE
0.16
other
0.16
åħ¶ä»ĸ
0.16
ught
0.16
LOPT
0.15
iov
0.15
aeper
0.14
ouri
0.14
Activations Density 0.028%