INDEX
Explanations
terms and concepts related to gender equality and women's empowerment
New Auto-Interp
Negative Logits
Pazar
-0.16
racial
-0.16
merican
-0.15
american
-0.14
Gerr
-0.14
ETH
-0.14
entiful
-0.14
allet
-0.14
American
-0.14
559
-0.14
POSITIVE LOGITS
UN
0.19
.UN
0.18
UN
0.18
violence
0.18
rights
0.17
æ´¥
0.17
203
0.17
girls
0.17
intersect
0.16
-viol
0.16
Activations Density 0.112%