INDEX
    Explanations

    This neuron responds to first-person, self-referential expressions (e.g. “I’m,” “I’ve,” “don’t,” “my”) indicating personal feelings or states.

    New Auto-Interp
    Negative Logits
    official
    -0.07
     eden
    -0.07
     gallons
    -0.06
     розум
    -0.06
     \"%
    -0.06
    .getVersion
    -0.06
     cavern
    -0.06
    _UNSUPPORTED
    -0.06
    USTER
    -0.06
     officially
    -0.06
    POSITIVE LOGITS
    0.06
    خصص
    0.06
    بع
    0.06
     sclerosis
    0.06
    -custom
    0.06
     proportions
    0.06
    �다
    0.06
    brook
    0.06
    父亲
    0.06
    iddi
    0.06
    Act Density 0.075%

    No Known Activations