INDEX
Negative Logits
feminist
-0.12
niñas
-0.11
motherhood
-0.11
сказала
-0.11
witches
-0.10
שאת
-0.10
dolls
-0.10
mujeres
-0.10
feminism
-0.10
güz
-0.10
POSITIVE LOGITS
beard
0.11
testosterone
0.11
dads
0.10
macho
0.10
masculino
0.10
俺
0.10
barbe
0.09
男性
0.09
pria
0.09
mascul
0.09
Activations Density 0.246%