INDEX
    Explanations

    This neuron fires on numeric tokens—especially floating‐point literals.

    New Auto-Interp
    Negative Logits
     спрос
    -0.07
    ——
    -0.06
    ことも
    -0.06
     bourgeoisie
    -0.06
    -0.06
    ظ
    -0.06
    kie
    -0.06
    ---
    -0.06
    shape
    -0.06
    одерж
    -0.06
    POSITIVE LOGITS
    .ds
    0.07
     ман
    0.07
    DST
    0.07
     RPG
    0.06
     automatic
    0.06
     Chic
    0.06
    localStorage
    0.06
     mensen
    0.06
     Poz
    0.06
     mobil
    0.06
    Act Density 0.001%

    No Known Activations