INDEX
Explanations
news articles detailing incidents involving violence and casualties
references to fatalities and injuries resulting from violent incidents
New Auto-Interp
Negative Logits
Printed
-0.77
soDeliveryDate
-0.75
Cros
-0.71
omics
-0.71
chrome
-0.70
disclaimer
-0.70
SPONSORED
-0.69
200000
-0.66
Patreon
-0.66
vanity
-0.66
POSITIVE LOGITS
apiece
0.92
evacuated
0.88
injured
0.83
injuring
0.82
NYPD
0.82
reportedly
0.81
disembark
0.79
wounded
0.78
bery
0.77
wounding
0.76
Activations Density 0.302%