INDEX
Explanations
ages in years
references to age and the concept of being "old."
New Auto-Interp
Negative Logits
coord
-0.66
phas
-0.65
otin
-0.62
Ts
-0.58
conduc
-0.57
anyl
-0.55
tnc
-0.54
Lanka
-0.54
amples
-0.53
HUD
-0.53
POSITIVE LOGITS
faced
0.80
digy
0.78
adulthood
0.76
years
0.75
infancy
0.75
adult
0.72
reincarn
0.68
now
0.67
wiser
0.66
Adult
0.65
Activations Density 0.091%