INDEX
Explanations
verbs indicating possibility or capability
New Auto-Interp
Negative Logits
Ú©ÙĨ
-0.16
aben
-0.16
ieber
-0.15
al
-0.15
ake
-0.15
HM
-0.14
889
-0.14
fail
-0.14
hm
-0.14
nod
-0.14
POSITIVE LOGITS
Raphael
0.15
ucas
0.15
uç
0.15
ANGLES
0.15
.nt
0.15
Ñĥж
0.14
ibble
0.14
worse
0.14
eds
0.14
enarios
0.14
Activations Density 0.054%