INDEX
Explanations
mentions of the pronoun "they" in various contexts
references to subjects referred to as "they."
New Auto-Interp
Negative Logits
Eleven
-0.83
CCC
-0.81
cgi
-0.68
VW
-0.66
paragraph
-0.66
////////////////////////////////
-0.65
Cable
-0.65
Globe
-0.65
CLASSIFIED
-0.65
Bull
-0.64
POSITIVE LOGITS
're
1.29
've
1.05
'll
1.01
'd
0.99
selves
0.99
themselves
0.86
selves
0.85
ikh
0.84
respective
0.83
self
0.79
Activations Density 0.257%