INDEX
Explanations
references to gender differences in activities and preferences
about women
New Auto-Interp
Negative Logits
łbym
-0.89
łem
-0.73
אתה
-0.70
Grandpa
-0.69
brotherhood
-0.69
Grandpa
-0.68
grandpa
-0.66
łeś
-0.66
attore
-0.64
seines
-0.63
POSITIVE LOGITS
women
1.69
Women
1.58
Women
1.51
women
1.48
feminist
1.46
female
1.40
girls
1.39
Girls
1.34
womanhood
1.33
woman
1.33
Activations Density 1.190%