INDEX
Explanations
data related to gender differences in various contexts
New Auto-Interp
Negative Logits
nsito
-0.46
Lorraine
-0.45
ValueStyle
-0.45
NOOP
-0.43
oscura
-0.42
grandma
-0.42
múl
-0.42
Deak
-0.42
grandmother
-0.42
useStyles
-0.41
POSITIVE LOGITS
gender
1.49
sexes
1.28
sex
1.27
gender
1.22
Gender
1.19
genders
1.19
sexe
1.19
Gender
1.13
sex
1.08
Sex
1.07
Activations Density 0.316%