INDEX
    Explanations

    phrases related to possessive pronouns

    New Auto-Interp
    Negative Logits
    ANS
    -0.70
    pmwiki
    -0.65
    WHERE
    -0.64
    IUM
    -0.63
    HQ
    -0.62
    ISM
    -0.60
    MAP
    -0.59
    GW
    -0.58
    MO
    -0.57
    MQ
    -0.57
    POSITIVE LOGITS
     his
    2.54
    his
    2.29
    His
    1.89
     himself
    1.78
     His
    1.64
     HIS
    1.59
    him
    1.49
     him
    1.47
     he
    1.29
     hers
    1.13
    Act Density 0.174%

    No Known Activations