INDEX
Explanations
descriptions related to violent acts, specifically murder
references to the word "murdered."
New Auto-Interp
Negative Logits
ffic
-0.81
rium
-0.79
aque
-0.79
Scot
-0.79
issue
-0.76
alter
-0.73
CLSID
-0.70
por
-0.69
gran
-0.69
Champ
-0.68
POSITIVE LOGITS
murdered
0.99
slain
0.85
murdering
0.83
spree
0.83
murders
0.79
ÃįÃį
0.77
killings
0.76
nesday
0.76
adolesc
0.74
ãĥĥãĥī
0.73
Activations Density 0.029%