INDEX
Explanations
references to violent acts or events, particularly those involving decapitation or execution
New Auto-Interp
Negative Logits
soType
-0.73
explan
-0.71
rium
-0.69
ZX
-0.69
CRE
-0.65
BuyableInstoreAndOnline
-0.64
easing
-0.62
paren
-0.62
Depth
-0.61
Import
-0.61
POSITIVE LOGITS
spree
0.87
corpses
0.83
murdered
0.83
murderers
0.81
murder
0.81
nikov
0.80
murders
0.76
murderer
0.75
death
0.75
victims
0.75
Activations Density 11.313%