INDEX
Explanations
references to marital relationships and the roles of wives and husbands
New Auto-Interp
Negative Logits
ichel
-0.16
ives
-0.15
males
-0.15
weg
-0.15
majority
-0.15
ibus
-0.14
presidency
-0.14
ithe
-0.14
bject
-0.14
ummings
-0.14
POSITIVE LOGITS
/part
0.27
hood
0.26
-wife
0.21
-to
0.19
prene
0.19
Ñĩина
0.18
elijke
0.17
/sign
0.17
-child
0.17
maids
0.17
Activations Density 0.056%