INDEX
    Explanations

    references to incidents involving law enforcement actions and violence

    New Auto-Interp
    Negative Logits
    lamaz
    -0.16
    ôme
    -0.15
    idges
    -0.14
    occo
    -0.14
    ughs
    -0.14
    WARDED
    -0.14
    ivas
    -0.14
     Typeface
    -0.13
    anded
    -0.13
    HORT
    -0.13
    POSITIVE LOGITS
    arer
    0.15
    æ¶ī
    0.15
     gerade
    0.15
    char
    0.14
     innocent
    0.14
    Popular
    0.14
    lee
    0.14
     ironically
    0.14
     Popular
    0.13
    nap
    0.13
    Act Density 0.142%

    No Known Activations