INDEX
Explanations
discussions about gender equality and women's representation in various fields
New Auto-Interp
Negative Logits
granddaughter
-0.23
daughter
-0.22
Daughter
-0.22
daughter
-0.22
доÑĩ
-0.18
panÃŃ
-0.18
Actress
-0.18
Mistress
-0.17
heroine
-0.17
niece
-0.17
POSITIVE LOGITS
men
1.16
males
0.91
male
0.88
Men
0.82
Männer
0.77
-men
0.77
Men
0.75
çĶ·æĢ§
0.72
çĶ·äºº
0.72
guys
0.71
Activations Density 0.564%