INDEX
Explanations
modal verbs indicating ability or possibility
New Auto-Interp
Negative Logits
Noir
-0.15
oeff
-0.14
tit
-0.14
наÑĩе
-0.14
ocado
-0.14
ocs
-0.14
odi
-0.14
odic
-0.14
odie
-0.13
aka
-0.13
POSITIVE LOGITS
WithOptions
0.17
deaux
0.15
ifo
0.15
Voll
0.15
chter
0.15
lom
0.14
ύ
0.14
WithType
0.14
ameron
0.14
íĥĿ
0.14
Activations Density 0.032%