INDEX
Explanations
phrases indicating historical timelines or references to specific time periods
New Auto-Interp
Negative Logits
utures
-0.16
qus
-0.15
欣
-0.15
kra
-0.15
æĬ
-0.14
era
-0.14
endar
-0.14
.ht
-0.14
Sevent
-0.14
forth
-0.14
POSITIVE LOGITS
196
0.18
462
0.17
194
0.17
184
0.16
WWII
0.16
antiqu
0.16
198
0.15
197
0.14
eri
0.14
195
0.14
Activations Density 0.099%