INDEX
Explanations
mentions of individuals or groups responsible for committing varying acts
terms related to individuals who commit crimes or harmful acts
New Auto-Interp
Negative Logits
psey
-0.83
zl
-0.66
aver
-0.65
eem
-0.65
paio
-0.65
ria
-0.65
opian
-0.64
Plat
-0.64
ARY
-0.64
rients
-0.62
POSITIVE LOGITS
perpetrated
1.03
perpetrators
0.92
perpetrator
0.90
offenders
0.83
offender
0.80
abused
0.77
abuse
0.76
Abuse
0.75
spree
0.73
Victims
0.73
Activations Density 0.011%