INDEX
Explanations
phrases related to women's issues and rights
references to women's issues and representation
New Auto-Interp
Negative Logits
USS
-0.81
uin
-0.74
Flavoring
-0.71
OUGH
-0.70
ondo
-0.69
æ©Ł
-0.69
Tough
-0.68
ebus
-0.67
à¼
-0.66
leased
-0.66
POSITIVE LOGITS
breasts
0.98
childbirth
0.95
herself
0.90
empowerment
0.89
lesbian
0.86
skirts
0.84
breastfeeding
0.83
wom
0.81
hijab
0.81
boyfriend
0.80
Activations Density 0.272%