INDEX
Explanations
words indicating necessity or obligation
New Auto-Interp
Negative Logits
ulent
-0.18
Ul
-0.16
olo
-0.15
zion
-0.15
istas
-0.15
ôi
-0.15
ULO
-0.14
ulle
-0.14
wet
-0.14
pcf
-0.14
POSITIVE LOGITS
antly
0.16
лаÑģ
0.15
пÑĢеж
0.15
ë©´ìłģ
0.15
ensely
0.15
ocha
0.15
lesen
0.15
Daw
0.14
лÑıÑħ
0.14
Adv
0.14
Activations Density 0.099%