INDEX
    Explanations

    references to years, particularly those related to events or accomplishments

    New Auto-Interp
    Negative Logits
    -seven
    -0.20
     seven
    -0.19
    -eight
    -0.18
    -nine
    -0.18
     seventh
    -0.17
    ptune
    -0.17
     nine
    -0.17
     eight
    -0.17
    seven
    -0.17
    -six
    -0.16
    POSITIVE LOGITS
    2
    0.27
    0
    0.27
    1
    0.23
    210
    0.20
    223
    0.20
    020
    0.19
    3
    0.18
    209
    0.18
    222
    0.18
    ï¼IJ
    0.18
    Act Density 0.041%

    No Known Activations