INDEX
Explanations
assertive actions or directives related to communication and scheduling
New Auto-Interp
Negative Logits
Yourself
-0.20
yourself
-0.19
youre
-0.19
ï¼Įä½ł
-0.17
THEY
-0.16
Votre
-0.16
your
-0.16
Your
-0.16
/us
-0.15
tus
-0.15
POSITIVE LOGITS
him
0.33
them
0.27
me
0.26
thee
0.24
y
0.23
ya
0.22
ihn
0.20
us
0.20
lui
0.20
him
0.20
Activations Density 0.153%