INDEX
Explanations
pronouns or proper nouns referring to individuals followed by an action or situation
occurrences of the pronoun "he" and related verbs indicating actions or events involving male characters
New Auto-Interp
Negative Logits
Score
-0.73
utical
-0.71
ILCS
-0.69
lead
-0.66
far
-0.63
cover
-0.62
iliate
-0.59
Exile
-0.58
Editors
-0.57
endorsements
-0.56
POSITIVE LOGITS
noticed
1.59
encountered
1.36
spotted
1.34
realized
1.31
heard
1.27
realised
1.26
discovered
1.24
sensed
1.23
stumbled
1.19
overheard
1.18
Activations Density 0.234%