INDEX
Explanations
phrases related to specific events or tragedies
mentions of significant events and entities related to terrorism and violence
New Auto-Interp
Negative Logits
PROV
-0.74
Gw
-0.69
depend
-0.69
Jake
-0.65
PAX
-0.65
Balt
-0.64
LOAD
-0.62
POS
-0.62
UT
-0.62
ACTION
-0.60
POSITIVE LOGITS
Hebdo
1.30
Massacre
0.78
tics
0.76
acher
0.75
anca
0.74
hiba
0.73
scl
0.72
Mansion
0.69
tsy
0.69
olini
0.69
Activations Density 0.048%