INDEX
Explanations
possessive pronouns, specifically variations of "my"
New Auto-Interp
Negative Logits
ibu
-0.16
blame
-0.14
mne
-0.14
ÑĤÑĭ
-0.14
pesan
-0.14
.gg
-0.14
istle
-0.14
æ¾
-0.14
fm
-0.13
edi
-0.13
POSITIVE LOGITS
own
0.20
/her
0.16
opia
0.16
_own
0.15
SELF
0.15
own
0.15
rtle
0.14
Own
0.14
ugin
0.14
AMB
0.13
Activations Density 0.094%