INDEX
    Explanations

    The neuron activates on numerical tokens—especially decimal numbers—embedded in technical text.

    New Auto-Interp
    Negative Logits
     수상
    -0.07
    Formatted
    -0.06
     zaw
    -0.06
     Alone
    -0.06
    Enumerator
    -0.06
    GOR
    -0.06
     truyền
    -0.06
     پرس
    -0.06
     حذ
    -0.06
    šla
    -0.06
    POSITIVE LOGITS
    0.07
     диаг
    0.07
     tidak
    0.06
    мотря
    0.06
    Nos
    0.06
     DEAL
    0.06
    variable
    0.06
    _MED
    0.06
     adv
    0.06
    itos
    0.06
    Act Density 0.024%

    No Known Activations