INDEX
Explanations
mentions of ages
references to age-related milestones or events
New Auto-Interp
Negative Logits
pard
-0.80
phis
-0.80
vernment
-0.78
alle
-0.74
eries
-0.70
endas
-0.67
alties
-0.66
ancial
-0.66
inis
-0.65
DCS
-0.64
POSITIVE LOGITS
Age
0.83
age
0.81
18
0.80
nineteen
0.79
adulthood
0.77
ripe
0.74
FontSize
0.73
19
0.73
21
0.72
puberty
0.71
Activations Density 0.021%