INDEX
Explanations
phrases related to the age of individuals, particularly using the term "olds"
references to age groups or the term "olds."
New Auto-Interp
Negative Logits
SOURCE
-0.69
yss
-0.65
ophys
-0.64
ethics
-0.63
onyms
-0.62
arbit
-0.60
constitu
-0.59
NX
-0.58
adjunct
-0.58
EMS
-0.58
POSITIVE LOGITS
chool
1.12
olds
0.98
pring
0.98
warm
0.92
hips
0.89
velt
0.88
aurus
0.83
mith
0.82
hack
0.79
erers
0.79
Activations Density 0.016%