INDEX
Explanations
gender-specific words related to roles of authority, particularly focusing on the presence of women in leadership positions
references to managers
New Auto-Interp
Negative Logits
teenth
-0.73
DonaldTrump
-0.69
INGTON
-0.68
FER
-0.67
çīĪ
-0.66
SPONSORED
-0.65
READ
-0.65
ENC
-0.64
cale
-0.62
lihood
-0.61
POSITIVE LOGITS
ial
1.05
ials
0.93
agers
0.77
iola
0.76
hips
0.76
icky
0.76
ubs
0.75
anova
0.74
ially
0.73
onym
0.71
Activations Density 0.042%