INDEX
Explanations
instances of sexual assault and related violent actions
New Auto-Interp
Negative Logits
issue
-0.32
immune
-0.32
issues
-0.32
hire
-0.30
fare
-0.30
repair
-0.30
tag
-0.30
fly
-0.30
gap
-0.30
bid
-0.29
POSITIVE LOGITS
Meredith
0.30
Samantha
0.30
prostitutes
0.29
nesday
0.28
TAMADRA
0.28
raping
0.28
agog
0.27
Paige
0.27
osexual
0.27
innoc
0.27
Activations Density 6.307%