INDEX
Explanations
possessive pronouns and references to ownership
New Auto-Interp
Negative Logits
oodoo
-0.18
ouz
-0.17
wood
-0.16
ardy
-0.15
ais
-0.15
ngine
-0.15
داÙħ
-0.15
jer
-0.14
cassert
-0.14
assen
-0.14
POSITIVE LOGITS
own
0.32
own
0.25
OWN
0.25
Own
0.25
Own
0.24
èĩªå·±çļĦ
0.20
_own
0.20
próp
0.19
èĩªå·±
0.18
OWN
0.17
Activations Density 0.443%