INDEX
Explanations
phrases that mention ages or durations expressed in years
New Auto-Interp
Negative Logits
ripple
-0.63
ulously
-0.63
illions
-0.60
ACTIONS
-0.59
iflower
-0.59
eus
-0.58
yrinth
-0.55
Dwell
-0.54
expansions
-0.54
contag
-0.54
POSITIVE LOGITS
old
1.10
olds
1.07
ago
0.96
old
0.90
younger
0.89
Ago
0.84
Old
0.82
overdue
0.80
age
0.79
olds
0.76
Activations Density 0.041%