INDEX
Explanations
references to gender, particularly male and female distinctions
New Auto-Interp
Negative Logits
kasarigan
-0.99
Plin
-0.93
IVEREF
-0.85
Athenians
-0.82
للاسماء
-0.81
ujednoznacz
-0.81
FormTagHelper
-0.81
itſelf
-0.80
bounties
-0.79
setVerticalGroup
-0.78
POSITIVE LOGITS
volent
0.82
Male
0.74
male
0.69
MALE
0.62
gender
0.60
Male
0.58
MALE
0.56
males
0.55
èse
0.51
asley
0.49
Activations Density 0.119%