INDEX
    Explanations

    incidents involving crime and social justice issues

    New Auto-Interp
    Negative Logits
    achte
    -0.15
    Ù쨱
    -0.15
    achten
    -0.15
     inflict
    -0.14
    iddy
    -0.14
     iParam
    -0.14
    igne
    -0.13
    ichen
    -0.13
    loh
    -0.13
     Coin
    -0.13
    POSITIVE LOGITS
    eyin
    0.18
    cken
    0.16
    iro
    0.16
    -schema
    0.15
    apon
    0.15
    ussen
    0.14
    rar
    0.14
     Kund
    0.14
    zi
    0.14
     kaf
    0.14
    Act Density 0.232%

    No Known Activations