INDEX
    Explanations

    recognizable names or phrases related to famous personalities, places, or events

    references to specific dates

    New Auto-Interp
    Negative Logits
     dime
    -0.73
    Reviewer
    -0.72
     thirds
    -0.67
     personally
    -0.58
     independents
    -0.57
    LLOW
    -0.56
     retard
    -0.55
     disabilities
    -0.54
    RO
    -0.53
     tru
    -0.52
    POSITIVE LOGITS
    vier
    1.32
    itor
    1.26
    esville
    1.25
    itors
    1.16
    ice
    1.12
    eway
    1.03
    elle
    0.94
    imus
    0.91
    umping
    0.89
    umper
    0.87
    Act Density 0.022%

    No Known Activations