INDEX
Explanations
references to violent actions resulting in harm or death
references to violent deaths and fatalities
New Auto-Interp
Negative Logits
wcsstore
-0.90
channelAvailability
-0.82
hist
-0.77
DragonMagazine
-0.76
soType
-0.74
arist
-0.72
deleg
-0.70
DIR
-0.70
Han
-0.67
hatt
-0.66
POSITIVE LOGITS
spree
0.94
rampage
0.80
gunshot
0.78
Sandra
0.76
stabbing
0.75
nsics
0.74
llah
0.73
psychiat
0.72
manslaughter
0.72
Trayvon
0.71
Activations Density 0.083%