INDEX
Explanations
personal pronouns followed by past tense action verbs
pronouns and references to characters in a narrative
New Auto-Interp
Negative Logits
intervening
-0.70
ythm
-0.70
departures
-0.70
phasis
-0.67
focus
-0.66
Independent
-0.66
jri
-0.66
Compensation
-0.66
contrasts
-0.65
phas
-0.65
POSITIVE LOGITS
stole
1.11
collected
0.97
wore
0.97
stash
0.97
purchased
0.96
stored
0.96
deposited
0.94
bought
0.92
confiscated
0.92
retrieved
0.91
Activations Density 0.166%