INDEX
Explanations
verbs that indicate change or transformation
New Auto-Interp
Negative Logits
iens
-0.14
aran
-0.14
inned
-0.14
ÏĮμε
-0.14
ell
-0.14
IME
-0.14
’n
-0.14
سع
-0.14
uy
-0.14
'n
-0.13
POSITIVE LOGITS
ahkan
0.17
ogle
0.16
odesk
0.15
Gund
0.15
hots
0.15
_ENABLE
0.14
Representative
0.14
Naw
0.14
ahl
0.14
wcs
0.14
Activations Density 0.219%