INDEX
Explanations
references to women and gender-related issues
New Auto-Interp
Negative Logits
înc
-0.76
limus
-0.75
للاسماء
-0.73
Idy
-0.70
ValueGenerated
-0.70
pendium
-0.68
Dade
-0.67
Tare
-0.67
AndEndTag
-0.66
Darcy
-0.64
POSITIVE LOGITS
Women
1.21
women
1.20
woman
1.20
Woman
1.20
WOMAN
1.17
Women
1.13
WOMEN
1.11
WOMAN
1.11
women
1.10
Woman
1.10
Activations Density 0.048%