INDEX
Explanations
references to parental relationships and family dynamics
New Auto-Interp
Negative Logits
cousin
-0.16
ugi
-0.15
colleague
-0.15
Cous
-0.15
Uncle
-0.14
ursal
-0.14
buddy
-0.14
817
-0.14
ampoo
-0.14
brethren
-0.14
POSITIVE LOGITS
parenting
0.23
loving
0.23
parent
0.23
family
0.21
nuclear
0.21
raising
0.21
biological
0.21
paren
0.20
parental
0.20
household
0.19
Activations Density 0.454%