INDEX
    Explanations

    mentions of a specific person referred to as "Her"

    occurrences of the pronoun "Her"

    New Auto-Interp
    Negative Logits
    otation
    -0.64
     orientation
    -0.63
    ynski
    -0.61
    ype
    -0.60
     lockout
    -0.58
    uto
    -0.57
    CVE
    -0.56
    uate
    -0.54
    peak
    -0.54
    amping
    -0.54
    POSITIVE LOGITS
     Her
    3.46
    Her
    2.65
     She
    2.09
     HER
    1.97
    She
    1.62
    her
    1.54
     her
    1.50
     herself
    1.49
     His
    1.41
    she
    1.35
    Act Density 0.009%

    No Known Activations