INDEX
Explanations
themes related to masculinity and men's health issues
New Auto-Interp
Negative Logits
herself
-0.18
ometown
-0.17
heroine
-0.16
ãĥ¼ãĥĸ
-0.16
lesbian
-0.15
ĵåIJį
-0.15
:async
-0.15
jejÃŃ
-0.15
å°ıå§IJ
-0.14
еÑij
-0.14
POSITIVE LOGITS
men
0.42
masculinity
0.41
males
0.41
male
0.40
masculine
0.36
mascul
0.35
çĶ·æĢ§
0.34
çĶ·åŃIJ
0.34
boys
0.34
Male
0.34
Activations Density 0.132%