INDEX
Explanations
mentions of violent crimes, particularly sexual assault and torture
references to violent crimes and abuses, particularly sexual violence
New Auto-Interp
Negative Logits
angelo
-0.78
hower
-0.75
natureconservancy
-0.71
heast
-0.71
Classic
-0.70
Ahead
-0.69
Balls
-0.69
Horizons
-0.68
eus
-0.68
Minutes
-0.68
POSITIVE LOGITS
rape
1.50
torture
1.41
murder
1.33
kidnapping
1.33
rapes
1.31
arson
1.31
intimidation
1.28
robbery
1.28
kidn
1.27
imprisonment
1.27
Activations Density 0.158%