INDEX
    Explanations

    uncertainty

    words related to communication and information sharing.

    This neuron does not activate on any tokens and thus does not detect any pattern.

    New Auto-Interp
    Negative Logits
    ackages
    -0.07
    Pipe
    -0.06
    imagenes
    -0.06
     username
    -0.06
    (filepath
    -0.06
    arse
    -0.06
    чай
    -0.06
     hashtag
    -0.06
     handleClick
    -0.06
     streamline
    -0.06
    POSITIVE LOGITS
    369
    0.07
    .inspect
    0.07
    0.06
    937
    0.06
     نف
    0.06
     گذ
    0.06
     طلا
    0.06
    xEB
    0.06
    ์ช
    0.06
     करत
    0.06
    Act Density 0.001%

    No Known Activations