INDEX
Explanations
references to children and parenting dynamics
parenting or childhood
children's actions or parental guidance
New Auto-Interp
Negative Logits
uomini
-0.61
mannen
-0.57
chồng
-0.57
vrouwen
-0.54
urismo
-0.53
uomo
-0.53
retirees
-0.53
زن
-0.52
wanita
-0.52
cưới
-0.52
POSITIVE LOGITS
parents
1.08
🧒
1.05
Parents
1.04
adults
0.98
PARENTS
0.96
parental
0.96
Parents
0.96
Parental
0.92
preschool
0.91
parent
0.90
Activations Density 0.562%