INDEX
    Explanations

    This neuron never activates for any token—it doesn’t respond to any pattern (i.e. it’s essentially “dead”).

    New Auto-Interp
    Negative Logits
     pathogens
    -0.07
    vit
    -0.06
    파일
    -0.06
     BigDecimal
    -0.06
    ٌ
    -0.06
    olarity
    -0.06
    CLU
    -0.06
    .IndexOf
    -0.06
    Shopping
    -0.06
    romo
    -0.06
    POSITIVE LOGITS
     sister
    0.07
     Delhi
    0.07
    define
    0.07
     Rotation
    0.07
    size
    0.06
     zvyš
    0.06
     Auto
    0.06
     halls
    0.06
     shorter
    0.06
    )}</
    0.06
    Act Density 0.025%

    No Known Activations