INDEX
    Explanations

    translation localization

    This neuron activates on words and phrases related to translation or localization.

    New Auto-Interp
    Negative Logits
     Garten
    -0.06
     excuse
    -0.06
    udent
    -0.06
     sondern
    -0.06
    udiantes
    -0.06
    toggleClass
    -0.06
    compact
    -0.06
    chemes
    -0.06
    -0.06
    ene
    -0.06
    POSITIVE LOGITS
    해보
    0.07
    -multi
    0.07
    0.06
     Fixes
    0.06
     relocate
    0.06
    _CART
    0.06
    .quant
    0.06
     =="
    0.06
     kron
    0.06
    0.06
    Act Density 0.019%

    No Known Activations