INDEX
    Explanations

    Programming code and data

    This neuron activates on numeric literal tokens—especially floating‐point numbers—wherever they appear in the code.

    New Auto-Interp
    Negative Logits
    literal
    -0.08
    titulo
    -0.07
    .abstract
    -0.07
    -0.07
    arda
    -0.07
     Gat
    -0.06
    Space
    -0.06
    .Fl
    -0.06
     Raptors
    -0.06
     %}↵
    -0.06
    POSITIVE LOGITS
    INGTON
    0.06
    ملة
    0.06
     landing
    0.06
     li
    0.06
     catast
    0.06
    ेब
    0.06
    earning
    0.06
     threatens
    0.06
    0.06
     getPosition
    0.05
    Act Density 0.014%

    No Known Activations