INDEX
Explanations
first person pronouns and related words
personal pronouns and first-person references
New Auto-Interp
Negative Logits
myself
-0.92
myself
-0.91
getMy
-0.83
meinem
-0.78
meines
-0.77
ผม
-0.76
myſelf
-0.75
expandindo
-0.75
adaptiveStyles
-0.74
my
-0.73
POSITIVE LOGITS
,
0.49
.
0.46
ết
0.46
—
0.39
and
0.36
?
0.35
to
0.33
!
0.33
Wiktionnaire
0.32
in
0.32
Activations Density 2.945%