INDEX
Explanations
references to gender, particularly male-related terms and roles
New Auto-Interp
Negative Logits
setVerticalGroup
-0.76
AutoScaleMode
-0.44
Sklici
-0.40
ISNI
-0.40
ExecuteReader
-0.39
Groetjes
-0.37
viewBox
-0.36
CascadeType
-0.36
externi
-0.34
تفصیلات
-0.34
POSITIVE LOGITS
manly
0.78
himself
0.77
masculino
0.75
męski
0.73
masculinity
0.71
mascul
0.71
masculinos
0.71
مردانه
0.68
masculina
0.68
Mascul
0.68
Activations Density 1.157%