INDEX
Explanations
discussions around gender roles and representation in society, particularly in STEM fields and traditional expectations
New Auto-Interp
Negative Logits
fk
-0.15
pupper
-0.14
ardy
-0.14
ngo
-0.13
xin
-0.13
غÙĨ
-0.13
uhe
-0.13
ạ
-0.13
erview
-0.13
Parent
-0.12
POSITIVE LOGITS
male
0.97
males
0.95
men
0.89
çĶ·
0.75
male
0.75
Male
0.74
çĶ·æĢ§
0.73
çĶ·
0.71
Male
0.69
boys
0.68
Activations Density 0.639%