INDEX
    Explanations

    This neuron activates on mentions of the “meaning of life, the universe, and everything” phrase.

    New Auto-Interp
    Negative Logits
     соб
    -0.07
     Sponsor
    -0.07
     Welfare
    -0.07
    pch
    -0.07
     základě
    -0.06
     одной
    -0.06
    ování
    -0.06
    ову
    -0.06
     darkness
    -0.06
     tách
    -0.06
    POSITIVE LOGITS
     provoked
    0.07
     Baş
    0.07
     اللغة
    0.07
    celand
    0.07
    Average
    0.07
    ціон
    0.06
     пак
    0.06
    latable
    0.06
     Romania
    0.06
     gắn
    0.06
    Act Density 0.002%

    No Known Activations