INDEX
Explanations
incidents involving death and destruction
phrases related to violent events and casualties
New Auto-Interp
Negative Logits
abase
-0.76
Marketable
-0.71
Invention
-0.67
wcsstore
-0.64
vanity
-0.63
Framework
-0.61
TPP
-0.60
heit
-0.60
Ide
-0.60
minecraft
-0.59
POSITIVE LOGITS
injuring
0.86
bystanders
0.81
evac
0.78
rench
0.76
gunmen
0.75
casualties
0.75
ousands
0.73
victims
0.72
inflamm
0.72
wounding
0.72
Activations Density 0.182%