INDEX
    Explanations

    pronouns referring to collective groups or individuals

    New Auto-Interp
    Negative Logits
     Tanz
    -0.75
     Neurolog
    -0.65
    amiya
    -0.63
    rose
    -0.63
    ilial
    -0.61
     Liang
    -0.60
     Hayden
    -0.59
    asionally
    -0.58
    umerable
    -0.58
     Garner
    -0.58
    POSITIVE LOGITS
     traction
    0.93
     bearings
    0.83
     acquainted
    0.83
     hooked
    0.79
     attention
    0.78
     foothold
    0.78
    started
    0.74
    chy
    0.74
     ready
    0.74
     juices
    0.73
    Act Density 0.115%

    No Known Activations