INDEX
    Explanations

    references to violent crimes and incidents

    New Auto-Interp
    Negative Logits
     Numerade
    -0.66
    TagMode
    -0.61
     plaisir
    -0.57
     كومونز
    -0.52
     سكانية
    -0.52
     Biar
    -0.50
    internalType
    -0.49
     adop
    -0.49
     Miscell
    -0.49
    ее
    -0.48
    POSITIVE LOGITS
     attacks
    0.77
     attack
    0.76
     robbery
    0.73
    setViewportView
    0.66
    Attacks
    0.65
     robberies
    0.62
     crime
    0.60
     serangan
    0.59
     robbers
    0.59
    attacks
    0.59
    Act Density 0.262%

    No Known Activations