INDEX
Explanations
reflexive pronouns and their usage
New Auto-Interp
Negative Logits
themselves
-0.30
itself
-0.29
himself
-0.28
ourselves
-0.24
herself
-0.22
myself
-0.21
oneself
-0.20
zich
-0.20
Himself
-0.20
yourself
-0.19
POSITIVE LOGITS
zelf
0.37
-même
0.30
/us
0.26
/my
0.22
elves
0.20
änd
0.19
aly
0.18
/self
0.18
adows
0.17
SELF
0.17
Activations Density 0.062%