INDEX
Explanations
references to gender, particularly focusing on males and the construct of masculinity
New Auto-Interp
Negative Logits
guys
-0.25
Guys
-0.21
men
-0.20
boys
-0.19
plevel
-0.17
guy
-0.17
Boys
-0.17
Sisters
-0.17
ners
-0.17
çĶ·
-0.17
POSITIVE LOGITS
volent
0.41
-dominated
0.30
factor
0.28
fic
0.27
/f
0.25
-bodied
0.24
vol
0.23
uada
0.22
faction
0.21
bonding
0.21
Activations Density 0.020%