INDEX
    Explanations

    phrases where a female subject is the main focus

    instances of the pronoun "she."

    New Auto-Interp
    Negative Logits
    kefeller
    -0.84
    emetery
    -0.71
    vernment
    -0.70
    antage
    -0.69
    undo
    -0.68
    hovah
    -0.65
    ypes
    -0.64
     Observatory
    -0.64
    PDATE
    -0.63
    odder
    -0.63
    POSITIVE LOGITS
     herself
    1.54
    pher
    1.46
    athed
    1.29
    athing
    1.23
    pard
    1.20
    ffield
    1.10
    pherd
    1.10
    ikh
    1.03
    lled
    1.02
    ppard
    0.99
    Act Density 0.127%

    No Known Activations