INDEX
Explanations
comparative phrases and expressions of quantity
New Auto-Interp
Negative Logits
\<^
-0.16
edBy
-0.15
ioxide
-0.15
wy
-0.15
geb
-0.15
.way
-0.14
onz
-0.14
scri
-0.14
ologically
-0.13
cairo
-0.13
POSITIVE LOGITS
many
0.17
veral
0.15
ivity
0.15
fewer
0.14
zell
0.14
Ľå»º
0.14
еÑĤÑĮ
0.14
ous
0.14
insk
0.14
dozens
0.14
Activations Density 0.189%