INDEX
    Explanations

    Code brackets

    This neuron activates on programming‐language syntax and identifiers (i.e. source‐code tokens) rather than natural‐language text.

    New Auto-Interp
    Negative Logits
    lif
    -0.06
    Downloader
    -0.06
    wash
    -0.06
    pres
    -0.06
     flavours
    -0.06
    vg
    -0.06
     rio
    -0.06
     мощ
    -0.06
    eres
    -0.05
    bios
    -0.05
    POSITIVE LOGITS
     Warren
    0.08
     AttributeSet
    0.07
     sum
    0.07
    نا
    0.07
     ему
    0.07
     添加
    0.07
     بد
    0.07
     taxable
    0.07
     میل
    0.07
    .resolve
    0.06
    Act Density 0.057%

    No Known Activations