INDEX
Explanations
pronouns indicating possession or ownership
New Auto-Interp
Negative Logits
ufe
-0.15
uler
-0.15
spar
-0.15
eum
-0.15
ζί
-0.14
incoming
-0.13
instanc
-0.13
ulu
-0.13
urovision
-0.13
dda
-0.13
POSITIVE LOGITS
own
0.35
own
0.24
Own
0.23
próp
0.21
esy
0.19
propio
0.19
_own
0.19
first
0.18
OWN
0.18
Own
0.18
Activations Density 0.114%