INDEX
Explanations
references to male health and wellness initiatives
New Auto-Interp
Negative Logits
å°ıå§IJ
-0.22
lesbian
-0.20
Actress
-0.20
granddaughter
-0.18
å§IJ
-0.17
å§ij
-0.17
heroine
-0.17
Lesbian
-0.17
herself
-0.17
convent
-0.16
POSITIVE LOGITS
masculinity
0.51
men
0.48
guys
0.44
masculine
0.43
testosterone
0.42
male
0.41
males
0.41
boys
0.40
mascul
0.38
çĶ·åŃIJ
0.38
Activations Density 0.233%