INDEX
Explanations
references to instances of sexual violence or assault
references to sexual assault and abuse
New Auto-Interp
Negative Logits
quickShipAvailable
-0.96
Solitaire
-0.84
olin
-0.76
GOODMAN
-0.75
Dispatch
-0.73
arij
-0.67
Eff
-0.65
SourceFile
-0.65
VK
-0.65
Desk
-0.64
POSITIVE LOGITS
sexually
1.12
assaulted
0.97
Sexual
0.95
ejac
0.93
raped
0.91
oriented
0.90
transmitted
0.89
sexual
0.86
genital
0.86
attracted
0.83
Activations Density 0.006%