INDEX
Explanations
mentions of women and gender-related topics
New Auto-Interp
Negative Logits
;width
-0.25
writer
-0.21
Woman
-0.20
woman
-0.20
,width
-0.20
(writer
-0.20
Womens
-0.19
writers
-0.19
wrists
-0.19
ÙĪÙĦÙĬ
-0.19
POSITIVE LOGITS
unw
0.21
monthly
0.18
men
0.18
girls
0.17
Month
0.17
-linux
0.16
-month
0.16
months
0.16
Monthly
0.16
girl
0.16
Activations Density 0.089%