INDEX
Explanations
phrases where a female subject is the main focus
instances of the pronoun "she."
New Auto-Interp
Negative Logits
kefeller
-0.84
emetery
-0.71
vernment
-0.70
antage
-0.69
undo
-0.68
hovah
-0.65
ypes
-0.64
Observatory
-0.64
PDATE
-0.63
odder
-0.63
POSITIVE LOGITS
herself
1.54
pher
1.46
athed
1.29
athing
1.23
pard
1.20
ffield
1.10
pherd
1.10
ikh
1.03
lled
1.02
ppard
0.99
Activations Density 0.127%