INDEX
Explanations
references to reproductive rights and family planning
New Auto-Interp
Negative Logits
Lair
-0.16
Preis
-0.15
-alt
-0.15
Engel
-0.14
Hague
-0.14
-0.14
Dup
-0.14
Neville
-0.14
Eh
-0.13
Levine
-0.13
POSITIVE LOGITS
pill
0.16
fisse
0.15
edis
0.15
oped
0.15
ANI
0.15
ISCO
0.15
ked
0.15
omid
0.14
asher
0.14
ÑĢÑĥд
0.14
Activations Density 0.061%