INDEX
Explanations
references to years, particularly those related to events or accomplishments
New Auto-Interp
Negative Logits
-seven
-0.20
seven
-0.19
-eight
-0.18
-nine
-0.18
seventh
-0.17
ptune
-0.17
nine
-0.17
eight
-0.17
seven
-0.17
-six
-0.16
POSITIVE LOGITS
2
0.27
0
0.27
1
0.23
210
0.20
223
0.20
020
0.19
3
0.18
209
0.18
222
0.18
ï¼IJ
0.18
Activations Density 0.041%