INDEX
    Explanations

    The neuron activates on occurrences of the first‐person pronoun “I.”

    New Auto-Interp
    Negative Logits
    �示
    -0.07
    Numbers
    -0.07
     ftp
    -0.07
    lx
    -0.07
    .AreEqual
    -0.07
    .fits
    -0.06
    (filepath
    -0.06
     BAT
    -0.06
     shipping
    -0.06
    _short
    -0.06
    POSITIVE LOGITS
     inform
    0.06
     threatening
    0.06
     Amer
    0.06
    าค
    0.06
     nikdo
    0.06
    Hook
    0.06
     Brittany
    0.06
    IRC
    0.06
    %M
    0.06
    pekt
    0.06
    Act Density 0.015%

    No Known Activations