INDEX
Explanations
references to male individuals and their reproductive roles or experiences
New Auto-Interp
Negative Logits
herself
-0.92
protoimpl
-0.73
bint
-0.68
internetowa
-0.62
giggled
-0.60
脚注の使い方
-0.58
Kaur
-0.58
vicina
-0.57
herself
-0.57
rawDesc
-0.56
POSITIVE LOGITS
himself
1.34
himself
1.16
masculinity
1.03
manhood
1.03
Himself
0.92
boyhood
0.91
masculino
0.86
Males
0.83
manly
0.83
męski
0.83
Activations Density 0.819%