INDEX
Explanations
phrases related to violent acts resulting in death
references to violent deaths and murder
New Auto-Interp
Negative Logits
æ©Ł
-0.81
inventoryQuantity
-0.75
ĨĴ
-0.73
rium
-0.71
Factor
-0.71
soType
-0.69
iago
-0.66
annis
-0.66
omaly
-0.66
Seller
-0.65
POSITIVE LOGITS
unarmed
0.96
murdering
0.92
murdered
0.81
senseless
0.80
rampage
0.78
murder
0.78
indiscrim
0.76
gunned
0.76
spree
0.75
revenge
0.75
Activations Density 0.135%