INDEX
    Explanations

    mentions of an action related to removing or kicking something or someone out

    occurrences of the word "Out" and its variations

    New Auto-Interp
    Negative Logits
     arsen
    -0.87
    avorite
    -0.73
    interstitial
    -0.69
    OTT
    -0.65
    ione
    -0.63
    vre
    -0.63
    =-=-=-=-=-=-=-=-
    -0.61
     turnover
    -0.58
     misunder
    -0.57
     Metallic
    -0.56
    POSITIVE LOGITS
    doors
    1.20
    rage
    1.08
    dated
    1.07
    breaks
    1.03
    fitted
    1.01
    casts
    1.00
    landish
    0.99
    fits
    0.98
    come
    0.97
    raged
    0.97
    Act Density 0.049%

    No Known Activations