INDEX
Explanations
names starting or ending with 'old'
references to age or the concept of being old
New Auto-Interp
Negative Logits
sclerosis
-0.84
mathemat
-0.80
ILCS
-0.77
aminer
-0.75
onday
-0.72
millenn
-0.72
pload
-0.71
earchers
-0.70
utterstock
-0.69
Ĥİ
-0.69
POSITIVE LOGITS
rums
1.16
est
1.03
orf
0.89
sworth
0.88
face
0.86
ering
0.86
Trafford
0.85
fashioned
0.85
town
0.83
erer
0.81
Activations Density 0.028%