INDEX
    Explanations

    This neuron responds to occurrences of the word “ignorance,” especially in the phrase “ignorance is bliss.”

    New Auto-Interp
    Negative Logits
     ساخته
    -0.07
    سازی
    -0.07
     acids
    -0.07
     powerful
    -0.07
     textual
    -0.07
     nuclei
    -0.07
     shaded
    -0.07
    _classes
    -0.06
    efa
    -0.06
     Elena
    -0.06
    POSITIVE LOGITS
     ignorant
    0.09
     unaware
    0.07
    oge
    0.07
     kilomet
    0.07
    abay
    0.07
     ignorance
    0.06
     â
    0.06
    operator
    0.06
     dys
    0.06
     arbitr
    0.06
    Act Density 0.008%

    No Known Activations