INDEX
Explanations
descriptions of individuals based on age
references to age, specifically using the term "old."
New Auto-Interp
Negative Logits
ichick
-0.97
TIME
-0.92
rosis
-0.88
vernment
-0.82
icter
-0.80
SHIP
-0.78
acly
-0.77
entimes
-0.77
gone
-0.77
oldown
-0.76
POSITIVE LOGITS
freshman
1.06
sophomore
1.04
student
0.96
boy
0.90
Frenchman
0.90
Nigerian
0.89
pupil
0.88
Mississ
0.86
graduate
0.86
girl
0.86
Activations Density 0.060%