INDEX
Explanations
references to women and gender equality
New Auto-Interp
Negative Logits
Woman
-0.23
Womens
-0.21
;width
-0.21
woman
-0.21
Women
-0.20
Woman
-0.20
Boy
-0.19
ÙĪÙĦÙĬ
-0.19
wrists
-0.18
女人
-0.18
POSITIVE LOGITS
men
0.24
unw
0.21
men
0.21
monthly
0.20
girls
0.19
-men
0.18
Men
0.17
Month
0.17
Monthly
0.17
Monthly
0.17
Activations Density 0.068%