INDEX
Explanations
pronouns indicating possession or ownership
New Auto-Interp
Negative Logits
uss
-0.19
Ire
-0.16
inkle
-0.16
alth
-0.15
tty
-0.15
182
-0.14
jes
-0.14
ILE
-0.14
kes
-0.14
ibs
-0.14
POSITIVE LOGITS
etical
0.15
past
0.15
éĺ
0.15
ONGL
0.15
اÙĨÙĩ
0.14
orpor
0.14
Past
0.14
stroy
0.13
iger
0.13
gorith
0.13
Activations Density 0.130%