INDEX
Explanations
references to specific years
New Auto-Interp
Negative Logits
069
-0.18
969
-0.18
596
-0.17
569
-0.17
067
-0.17
396
-0.16
670
-0.16
597
-0.16
674
-0.15
397
-0.15
POSITIVE LOGITS
8
0.32
eight
0.31
Eight
0.30
Eight
0.30
eight
0.29
VIII
0.28
Û¸
0.27
Eighth
0.26
18
0.25
८
0.25
Activations Density 0.034%