INDEX
Explanations
words and phrases related to familial relationships, particularly involving daughters
his daughter
New Auto-Interp
Negative Logits
mother
-1.01
Mother
-0.98
Mother
-0.96
mothers
-0.95
mother
-0.93
Mrs
-0.90
mom
-0.88
Mrs
-0.85
MOTHER
-0.85
Mom
-0.84
POSITIVE LOGITS
girl
1.50
girls
1.24
girl
1.20
Girl
1.16
GIRL
1.12
Girl
1.10
Girls
1.03
GIRLS
1.03
girls
0.99
Girls
0.99
Activations Density 0.133%