INDEX
Explanations
mentions of crimes like murder
instances of the word "murder."
New Auto-Interp
Negative Logits
Cola
-0.80
wcsstore
-0.76
ais
-0.73
UTC
-0.73
BuyableInstoreAndOnline
-0.70
Dub
-0.68
imus
-0.67
uve
-0.67
ube
-0.66
arity
-0.65
POSITIVE LOGITS
spree
1.15
murder
1.03
murders
0.93
ously
0.90
homicide
0.89
hyde
0.87
rampage
0.86
murderer
0.86
murdering
0.85
Murder
0.84
Activations Density 0.031%