INDEX
Explanations
male and female gender-specific terms
references to male and female identities or roles
New Auto-Interp
Negative Logits
èª
-0.82
ENCY
-0.71
ģĸ
-0.71
ENTS
-0.69
OPS
-0.68
ears
-0.67
{{-0.65
NRS
-0.65
ãĥĦ
-0.64
î
-0.64
POSITIVE LOGITS
volent
1.25
Male
1.15
Female
0.94
Males
0.79
vich
0.79
uscript
0.73
ering
0.73
genital
0.73
mating
0.68
Ath
0.65
Activations Density 0.007%