INDEX
Explanations
references to terrorist attacks and military operations
references to attacks or violence related to geopolitical events
New Auto-Interp
Negative Logits
uid
-0.91
drawn
-0.91
galitarian
-0.85
wrinkles
-0.83
roo
-0.81
ãĤ´ãĥ³
-0.80
ripp
-0.77
igmat
-0.75
itus
-0.75
lied
-0.75
POSITIVE LOGITS
civilians
1.28
unarmed
1.10
civilian
1.07
innocent
1.04
embassies
1.04
strongh
0.97
convoy
0.96
Kabul
0.96
targets
0.95
Gaza
0.92
Activations Density 0.216%