INDEX
    Explanations

    references to foundational figures and concepts in various fields of study

    New Auto-Interp
    Negative Logits
     lately
    -0.17
     newly
    -0.16
     recently
    -0.15
    æĹ§
    -0.15
    аблиÑĨ
    -0.14
    cko
    -0.14
     Newly
    -0.14
    latest
    -0.14
     older
    -0.14
     latest
    -0.14
    POSITIVE LOGITS
     modern
    0.28
     moderne
    0.23
    modern
    0.21
     Modern
    0.19
     idea
    0.19
    å¾Įãģ®
    0.19
     modem
    0.19
    Modern
    0.19
     concept
    0.18
     earliest
    0.18
    Act Density 0.210%

    No Known Activations