INDEX
    Explanations

    possessive pronouns

    possessive pronouns, particularly "his."

    New Auto-Interp
    Negative Logits
    etheless
    -0.64
     both
    -0.63
    IVERS
    -0.59
     alike
    -0.59
     unden
    -0.55
    @#&
    -0.49
    amily
    -0.49
     personalities
    -0.48
    ANGE
    -0.47
    Cry
    -0.47
    POSITIVE LOGITS
    /
    1.60
     or
    1.45
    /#
    1.20
    / 
    1.19
    /,
    1.19
    /.
    1.17
    panic
    1.16
    /"
    1.08
    /)
    1.05
    /(
    1.04
    Act Density 0.216%

    No Known Activations