INDEX
Explanations
descriptions related to specific age groups, particularly focusing on infants and toddlers
phrases indicating the age and gender of children
New Auto-Interp
Negative Logits
anwhile
-0.96
tremend
-0.79
hement
-0.77
headers
-0.74
vernment
-0.73
prosec
-0.73
kefeller
-0.72
papers
-0.70
chwitz
-0.70
dep
-0.68
POSITIVE LOGITS
boy
0.95
daughter
0.93
girl
0.89
grandson
0.88
boys
0.87
son
0.86
toddler
0.83
children
0.82
girls
0.81
granddaughter
0.80
Activations Density 0.056%