INDEX
Explanations
comparative phrases indicating a measure of quantity or quality
New Auto-Interp
Negative Logits
ãģĹãģĭ
-0.16
rote
-0.15
roid
-0.14
Kron
-0.14
Kurum
-0.14
HEEL
-0.14
.FC
-0.14
fü
-0.14
inds
-0.13
¶
-0.13
POSITIVE LOGITS
than
0.17
_than
0.17
except
0.15
esis
0.15
елов
0.15
Adler
0.14
кÑĢоме
0.14
zza
0.14
except
0.14
obby
0.14
Activations Density 0.058%