INDEX
Explanations
instances where the protagonist takes action or demonstrates agency
instances of the pronoun "I."
New Auto-Interp
Negative Logits
pires
-0.66
tains
-0.63
ãĥĬ
-0.62
INGTON
-0.60
groupon
-0.58
Sterling
-0.58
Emin
-0.58
electromagnetic
-0.58
iston
-0.56
Philipp
-0.56
POSITIVE LOGITS
'm
1.32
've
1.26
suppose
1.16
guess
1.05
'll
1.04
'd
1.03
awoke
0.98
ggy
0.97
presume
0.97
ulia
0.92
Activations Density 0.286%