INDEX
Explanations
mentions of women in various contexts, suggesting a focus on gender-related content
references to women and women's issues
New Auto-Interp
Negative Logits
rador
-0.79
UFF
-0.78
asper
-0.76
REC
-0.75
REDACTED
-0.75
ype
-0.74
opher
-0.73
ellect
-0.71
OWN
-0.70
-+-+
-0.70
POSITIVE LOGITS
folk
1.14
empowerment
1.03
borg
0.90
breasts
0.88
contraceptive
0.87
genital
0.86
hood
0.84
menstru
0.84
opausal
0.83
genitals
0.83
Activations Density 0.058%