INDEX
Explanations
personal pronouns related to self-reflection
references to the pronoun "you."
New Auto-Interp
Negative Logits
¿½
-0.73
assembly
-0.66
Thomson
-0.65
stadt
-0.60
20439
-0.60
EStream
-0.58
pty
-0.57
Commerce
-0.57
Innocent
-0.56
shows
-0.56
POSITIVE LOGITS
're
1.80
've
1.52
'll
1.32
'd
1.10
yourselves
1.07
owe
1.07
know
1.06
are
1.04
yourself
1.02
ngth
1.02
Activations Density 0.247%