INDEX
    Explanations

    directional

    This neuron does not activate on any of the input tokens, indicating it is effectively inactive or unresponsive.

    New Auto-Interp
    Negative Logits
     manuscript
    -0.07
     fragmented
    -0.06
     Range
    -0.06
    rites
    -0.06
    Luke
    -0.06
     apart
    -0.06
    CellStyle
    -0.06
    -0.06
     Scout
    -0.05
     weg
    -0.05
    POSITIVE LOGITS
    irectional
    0.07
    로부터
    0.07
    čního
    0.07
     devastating
    0.07
     Takım
    0.06
    .transfer
    0.06
    ikt
    0.06
    0.06
    ßerdem
    0.06
     yılında
    0.06
    Act Density 0.001%

    No Known Activations