INDEX
    Explanations

    pronouns or noun phrases denoting a group of people

    pronouns indicating individuals or groups

    New Auto-Interp
    Negative Logits
     Peak
    -0.72
    tains
    -0.64
    Applications
    -0.61
     immunity
    -0.60
    pires
    -0.60
     Outside
    -0.58
     Gore
    -0.56
     premature
    -0.56
     Fail
    -0.56
     Unlimited
    -0.55
    POSITIVE LOGITS
    'll
    1.00
    're
    0.96
    've
    0.92
    ngth
    0.86
    'd
    0.82
    bsite
    0.81
    ald
    0.79
    ggy
    0.78
    bart
    0.76
    'm
    0.75
    Act Density 0.287%

    No Known Activations