INDEX
Explanations
verbs indicating movement or transition
New Auto-Interp
Negative Logits
pom
-0.18
QUIT
-0.15
klady
-0.15
weg
-0.14
chance
-0.14
forme
-0.14
pit
-0.14
اÙĦÙĤد
-0.14
quina
-0.14
686
-0.14
POSITIVE LOGITS
alian
0.15
luž
0.15
orgia
0.15
alker
0.14
ноÑģÑĤ
0.14
iam
0.14
rish
0.14
_ISO
0.14
olg
0.13
intox
0.13
Activations Density 0.095%