INDEX
Explanations
mentions of parents
references to parents and their roles or influences
New Auto-Interp
Negative Logits
knife
-0.73
omaly
-0.72
cture
-0.72
machine
-0.65
Bench
-0.65
jam
-0.64
hole
-0.64
lore
-0.62
rep
-0.62
LP
-0.61
POSITIVE LOGITS
parents
3.73
Parents
2.98
Parents
2.92
parents
2.89
grandparents
2.44
mothers
2.41
moms
2.40
dads
2.32
parent
2.32
fathers
2.16
Activations Density 0.016%