INDEX
    Explanations

    phrases related to attributing actions or qualities to individuals

    New Auto-Interp
    Negative Logits
    hammer
    -0.66
     Apart
    -0.62
    esp
    -0.61
    catentry
    -0.60
     Voters
    -0.60
    Arc
    -0.60
    ocol
    -0.59
    eal
    -0.59
    tel
    -0.59
    oshi
    -0.56
    POSITIVE LOGITS
     been
    1.35
    been
    1.22
     gotten
    1.12
     Been
    1.02
     undergone
    1.02
     done
    0.93
     gone
    0.92
     fallen
    0.91
     become
    0.90
     begun
    0.87
    Act Density 0.461%

    No Known Activations