INDEX
Explanations
expressions of self-identity and emotional states
New Auto-Interp
Negative Logits
conmigo
-1.16
comigo
-1.02
nous
-0.96
us
-0.86
conosco
-0.82
нам
-0.81
meille
-0.81
nás
-0.80
me
-0.80
لنا
-0.79
POSITIVE LOGITS
myself
2.32
myself
1.67
myſelf
1.44
Myself
1.33
خودم
0.97
AnchorStyles
0.97
我自己
0.88
говорю
0.87
personally
0.85
हूं
0.84
Activations Density 0.823%