INDEX
Explanations
topics related to gender equality and women's representation
New Auto-Interp
Negative Logits
taste
-0.29
حب
-0.26
couvrez
-0.25
TextWatcher
-0.25
Blut
-0.24
NEXT
-0.24
păr
-0.24
雀
-0.24
Einf
-0.24
indah
-0.24
POSITIVE LOGITS
Women
1.00
Women
0.99
women
0.96
féminine
0.93
WOMEN
0.93
Gender
0.90
women
0.89
gender
0.87
Gender
0.87
feminine
0.87
Activations Density 0.553%