INDEX
    Explanations

    references to nostalgia, anniversaries, and classic cultural works

    New Auto-Interp
    Negative Logits
    atest
    -0.16
     latest
    -0.15
     Newest
    -0.15
    /cms
    -0.14
    rong
    -0.14
     Latest
    -0.14
     delete
    -0.14
    ronic
    -0.14
     newest
    -0.14
     reconstruct
    -0.14
    POSITIVE LOGITS
     classic
    0.17
     classics
    0.17
    197
    0.17
    198
    0.16
    oldt
    0.16
     iconic
    0.15
    utow
    0.15
    classic
    0.15
     huku
    0.15
    zan
    0.14
    Act Density 0.293%

    No Known Activations