INDEX
Explanations
phrases related to women or women's issues
references to women and women's issues
New Auto-Interp
Negative Logits
uncle
-0.81
lished
-0.80
las
-0.72
odon
-0.72
rette
-0.72
uin
-0.71
nces
-0.71
hiba
-0.70
ills
-0.70
Quantity
-0.70
POSITIVE LOGITS
rights
0.96
liberation
0.83
restroom
0.82
empowerment
0.80
apparel
0.79
Rights
0.75
basketball
0.75
Liberation
0.74
rights
0.74
clubs
0.73
Activations Density 0.062%