INDEX
    Explanations

    references to military or police operations

    references to "raids" or related concepts

    New Auto-Interp
    Negative Logits
     warmed
    -0.68
    milo
    -0.64
    chell
    -0.63
    DonaldTrump
    -0.63
     Leilan
    -0.62
    assetsadobe
    -0.60
    ogyn
    -0.60
    Ian
    -0.59
     Hurricanes
    -0.58
    issues
    -0.56
    POSITIVE LOGITS
     raid
    0.95
     raids
    0.90
     raided
    0.84
    ers
    0.84
    ishly
    0.82
    nard
    0.81
     raiding
    0.76
    iversary
    0.75
    artment
    0.75
     netted
    0.73
    Act Density 0.038%

    No Known Activations