INDEX
    Explanations

    This neuron activates on numeric tokens, particularly decimal numbers.

    New Auto-Interp
    Negative Logits
     />,↵
    -0.07
    nces
    -0.06
    _fun
    -0.06
     QPointF
    -0.06
     incel
    -0.06
    าของ
    -0.06
     plat
    -0.06
     konkrét
    -0.06
    Ans
    -0.06
    	fflush
    -0.06
    POSITIVE LOGITS
     din
    0.08
     파일
    0.07
     toxicity
    0.07
     whisper
    0.07
    –and
    0.07
    assist
    0.07
     Contributions
    0.06
     Packaging
    0.06
    .Part
    0.06
     ####
    0.06
    Act Density 0.032%

    No Known Activations