INDEX
    Explanations

    terms related to riots and violence

    New Auto-Interp
    Negative Logits
    <bos>
    -0.83
     boop
    -0.81
     affez
    -0.80
     sento
    -0.80
     luigi
    -0.80
     trovo
    -0.79
     logitech
    -0.76
     imgur
    -0.76
     hasbro
    -0.76
     wikihow
    -0.75
    POSITIVE LOGITS
     riots
    0.80
     riot
    0.69
     unrest
    0.61
     violence
    0.59
     uprising
    0.58
     erupted
    0.55
    riot
    0.53
     disturbances
    0.52
     demonstrations
    0.51
     Riot
    0.51
    Act Density 0.307%

    No Known Activations