INDEX
Explanations
phrases related to discrimination and violence against vulnerable groups
references to discrimination and violence against various groups
New Auto-Interp
Negative Logits
soDeliveryDate
-1.07
ERN
-0.89
natureconservancy
-0.82
hower
-0.80
orah
-0.79
chell
-0.74
Cosponsors
-0.71
}}}
-0.71
oppy
-0.70
psons
-0.69
POSITIVE LOGITS
soever
0.87
civilians
0.85
innocent
0.81
humanity
0.78
unborn
0.78
barbar
0.76
prostitutes
0.75
harming
0.74
women
0.74
oppression
0.72
Activations Density 0.075%