INDEX
Explanations
phrases related to different forms of violence, specifically domestic violence
references to domestic violence and abuse
New Auto-Interp
Negative Logits
Reviewer
-0.86
Flag
-0.80
hart
-0.78
UMP
-0.75
isse
-0.73
jon
-0.71
Recipe
-0.70
DIT
-0.69
mand
-0.69
ISSION
-0.68
POSITIVE LOGITS
violence
1.06
abuse
0.97
prevention
0.91
abusers
0.89
homicides
0.87
Violence
0.86
offenders
0.85
gangs
0.83
abuse
0.82
homelessness
0.81
Activations Density 0.018%