INDEX
Explanations
mentions of different forms of sexual assault, including allegations, reports, and discussions
terms related to sexual assault and harassment
New Auto-Interp
Negative Logits
peed
-0.75
IPS
-0.74
peror
-0.74
WARD
-0.68
AY
-0.68
AIR
-0.68
æ°
-0.67
KC
-0.67
XM
-0.65
ãĤ©
-0.65
POSITIVE LOGITS
Sexual
0.96
Rape
0.90
allegations
0.87
rape
0.86
perpetrated
0.83
victims
0.79
raped
0.79
sexually
0.79
abuse
0.79
allegation
0.78
Activations Density 0.069%