INDEX
Explanations
years mentioned in the text
New Auto-Interp
Negative Logits
achus
-0.69
chat
-0.65
HUD
-0.64
subp
-0.64
akin
-0.61
chy
-0.61
peak
-0.61
Apps
-0.58
illions
-0.57
perse
-0.57
POSITIVE LOGITS
olds
0.90
old
0.89
old
0.87
anniversary
0.83
ago
0.79
olds
0.78
OLD
0.69
Ago
0.68
veteran
0.68
Anniversary
0.67
Activations Density 0.031%