INDEX
    Explanations

    The neuron activates on numerical tokens formatted as decimal (floating-point) numbers.

    New Auto-Interp
    Negative Logits
    nge
    -0.06
    ۱۱
    -0.06
     fairy
    -0.06
    veral
    -0.06
    ображ
    -0.06
     eleven
    -0.06
    Experimental
    -0.06
    xe
    -0.06
    _disk
    -0.06
    math
    -0.06
    POSITIVE LOGITS
     kosher
    0.07
     caption
    0.06
     modulo
    0.06
    mens
    0.06
     assignable
    0.06
     transformer
    0.06
     named
    0.06
    是一个
    0.06
     nacional
    0.06
     شر
    0.06
    Act Density 0.034%

    No Known Activations