INDEX
Explanations
statements related to controversial or scandalous behavior
derogatory terms and inflammatory statements aimed at individuals or groups
New Auto-Interp
Negative Logits
partName
-0.69
Printed
-0.62
anecd
-0.60
cellaneous
-0.59
icion
-0.57
Quarterly
-0.57
unfocusedRange
-0.57
confir
-0.57
aback
-0.57
misunder
-0.56
POSITIVE LOGITS
raping
0.88
raped
0.77
rape
0.76
rapist
0.70
sex
0.70
adultery
0.69
vagina
0.69
genitals
0.69
sexually
0.68
rapists
0.67
Activations Density 1.763%