INDEX
Explanations
mentions of new babies and family additions
New Auto-Interp
Negative Logits
illos
-0.17
husbands
-0.15
divorced
-0.14
YRO
-0.14
erva
-0.14
коп
-0.14
_assoc
-0.14
име
-0.14
Cre
-0.14
idos
-0.13
POSITIVE LOGITS
baby
0.31
baby
0.25
Baby
0.25
bundle
0.23
Baby
0.22
BAB
0.21
bundles
0.20
little
0.20
bundle
0.20
babies
0.20
Activations Density 0.060%