INDEX
Explanations
mentions of women or gender-related topics
references to women and gender issues
New Auto-Interp
Negative Logits
REDACTED
-0.86
opher
-0.76
asper
-0.76
anoia
-0.75
ebus
-0.75
rador
-0.75
eme
-0.74
ype
-0.72
RAY
-0.72
UFF
-0.71
POSITIVE LOGITS
folk
1.21
empowerment
1.07
genital
1.00
reproductive
0.92
opausal
0.88
breasts
0.87
menstru
0.87
genitals
0.86
entrepreneurs
0.81
nipples
0.80
Activations Density 0.086%