INDEX
Explanations
ages or age-related information
mentions of specific age groups
New Auto-Interp
Negative Logits
eln
-0.85
Bundy
-0.80
atl
-0.71
rog
-0.70
lar
-0.70
DCS
-0.68
vernment
-0.67
atari
-0.65
gotten
-0.64
chief
-0.63
POSITIVE LOGITS
age
0.85
ages
0.77
Age
0.75
liest
0.74
angering
0.73
cohorts
0.73
omething
0.71
checks
0.71
olds
0.70
Wallet
0.70
Activations Density 0.035%