INDEX
Explanations
references to gender, particularly male and female distinctions
the male gender
male and female distinctions
New Auto-Interp
Negative Logits
kasarigan
-0.98
itſelf
-0.91
intptr
-0.84
Plin
-0.82
myſelf
-0.82
Roskov
-0.81
صوتيه
-0.80
Athenians
-0.78
themſelves
-0.78
snippetHide
-0.77
POSITIVE LOGITS
volent
0.87
gender
0.81
male
0.78
Male
0.78
MALE
0.66
Male
0.61
gender
0.61
MALE
0.60
males
0.58
sex
0.56
Activations Density 0.093%