INDEX
    Explanations

    math problems

    This neuron activates on phrases referring to marginal probability density functions.

    New Auto-Interp
    Negative Logits
    -primary
    -0.07
    _games
    -0.06
     Mormons
    -0.06
     Monday
    -0.06
    Samples
    -0.06
     DESC
    -0.06
    Campo
    -0.06
    Add
    -0.06
     siblings
    -0.06
    -0.06
    POSITIVE LOGITS
     غذ
    0.06
     pomoc
    0.06
    上传
    0.06
     tout
    0.06
    نة
    0.06
    .YELLOW
    0.06
    _DL
    0.06
    '=>
    0.06
    τηκε
    0.06
     về
    0.06
    Act Density 0.023%

    No Known Activations