INDEX
Explanations
instances of first-person singular pronouns
the presence of the first-person pronoun "I."
New Auto-Interp
Negative Logits
Jarrett
-0.64
Hazard
-0.60
tains
-0.59
lihood
-0.57
Vald
-0.57
INGTON
-0.57
Aston
-0.56
Electrical
-0.56
Philipp
-0.56
Azerbai
-0.56
POSITIVE LOGITS
'm
1.45
've
1.43
'll
1.22
'd
1.20
suppose
1.18
guess
1.04
ggy
0.99
owe
0.98
RL
0.96
presume
0.95
Activations Density 0.356%