INDEX
Explanations
references to personal experiences and storytelling
Possessive pronouns and related words
possessive pronouns
New Auto-Interp
Negative Logits
We
-0.85
we
-0.82
We
-0.74
I
-0.72
we
-0.67
WE
-0.63
незавершена
-0.60
与其
-0.57
WE
-0.56
ihre
-0.55
POSITIVE LOGITS
our
1.81
my
1.72
our
1.34
nosso
1.33
my
1.32
nuestro
1.27
nuestros
1.26
våra
1.25
Our
1.25
meu
1.24
Activations Density 0.385%