INDEX
Explanations
references to women's rights and empowerment
New Auto-Interp
Negative Logits
Ethnic
-0.17
ethnic
-0.16
ethnic
-0.16
racially
-0.15
velt
-0.15
racial
-0.14
ethnicity
-0.14
sodom
-0.14
اÙĦÙħÙĪØ³
-0.14
ecz
-0.13
POSITIVE LOGITS
women
0.52
Women
0.45
Women
0.42
women
0.41
womens
0.36
woman
0.35
Womens
0.35
Woman
0.34
female
0.33
females
0.33
Activations Density 0.428%