INDEX
Explanations
phrases related to a person's life events and achievements
pronouns referring to a specific subject, primarily the male pronoun "He" and its variations
New Auto-Interp
Negative Logits
SI
-0.76
intent
-0.68
=-=-=-=-=-=-=-=-
-0.66
warning
-0.65
Saying
-0.63
pointing
-0.63
Warning
-0.63
bothering
-0.62
Pause
-0.62
Sorry
-0.60
POSITIVE LOGITS
remained
1.22
became
1.21
subsequently
1.19
graduated
1.19
married
1.19
eventually
1.18
continued
1.18
later
1.18
survived
1.18
participated
1.17
Activations Density 0.181%