INDEX
    Explanations

    mentions of riots or riot-related activities

    instances of the word "riot" and its variations

    New Auto-Interp
    Negative Logits
    hered
    -0.68
    sonian
    -0.66
    omething
    -0.66
    DonaldTrump
    -0.65
    ledged
    -0.63
    ĻĤ
    -0.63
    ournal
    -0.63
    stellar
    -0.62
    ULTS
    -0.61
     Copy
    -0.61
    POSITIVE LOGITS
    ous
    1.08
    ously
    0.97
    ers
    0.95
    ing
    0.94
    rained
    0.91
     riot
    0.90
    osity
    0.86
    naire
    0.83
    eering
    0.81
     riots
    0.80
    Act Density 0.032%

    No Known Activations