INDEX
Explanations
years in the 20th century
specific years and numerical data
New Auto-Interp
Negative Logits
otle
-0.81
light
-0.76
awed
-0.74
intosh
-0.74
sylvania
-0.73
ledged
-0.72
sonian
-0.72
anooga
-0.71
achu
-0.69
fall
-0.69
POSITIVE LOGITS
nd
1.93
ND
0.99
naire
0.80
thirds
0.76
ipop
0.74
ally
0.73
nces
0.71
80
0.70
ppo
0.70
147
0.68
Activations Density 0.152%