INDEX
    Explanations

    The neuron activates on numeric tokens representing years (date references).

    New Auto-Interp
    Negative Logits
    edido
    -0.06
    中央
    -0.06
     Hind
    -0.06
    _glyph
    -0.06
     موضوع
    -0.06
    711
    -0.06
    enor
    -0.06
     tactical
    -0.06
    enh
    -0.06
     briefly
    -0.06
    POSITIVE LOGITS
    (dir
    0.07
     позвол
    0.07
     economic
    0.07
    \↵
    0.06
    ungi
    0.06
     decipher
    0.06
    *******
    ↵
    0.06
    auce
    0.06
    \\"
    0.06
     «
    0.06
    Act Density 0.014%

    No Known Activations