INDEX
    Explanations

    references to terrorist attacks and military operations

    references to attacks or violence related to geopolitical events

    New Auto-Interp
    Negative Logits
    uid
    -0.91
    drawn
    -0.91
    galitarian
    -0.85
     wrinkles
    -0.83
    roo
    -0.81
    ãĤ´ãĥ³
    -0.80
    ripp
    -0.77
    igmat
    -0.75
    itus
    -0.75
    lied
    -0.75
    POSITIVE LOGITS
     civilians
    1.28
     unarmed
    1.10
     civilian
    1.07
     innocent
    1.04
     embassies
    1.04
     strongh
    0.97
     convoy
    0.96
     Kabul
    0.96
     targets
    0.95
     Gaza
    0.92
    Act Density 0.216%

    No Known Activations