INDEX
Explanations
references to criminal activities, particularly focusing on incidents involving crime and violence
references to homicide or related incidents
New Auto-Interp
Negative Logits
stals
-0.69
Tokens
-0.67
oga
-0.65
stream
-0.65
tune
-0.63
yn
-0.63
bler
-0.62
qual
-0.61
Appl
-0.61
Allows
-0.60
POSITIVE LOGITS
homicide
3.82
homicides
2.76
manslaughter
1.98
murder
1.84
murders
1.61
crime
1.49
suicide
1.46
burglary
1.45
killings
1.42
Murder
1.39
Activations Density 0.008%