INDEX
    Explanations

    The neuron activates on past-tense verb tokens (e.g., “created,” “built,” “generated,” “made”).

    New Auto-Interp
    Negative Logits
     Sud
    -0.06
     trials
    -0.06
     prominent
    -0.06
     bdsm
    -0.06
     commanders
    -0.06
    ashion
    -0.06
    -filter
    -0.06
     parece
    -0.06
    .mime
    -0.06
    -ring
    -0.06
    POSITIVE LOGITS
    .Col
    0.07
     mẽ
    0.07
    ”?
    0.07
     Ã
    0.07
     तरफ
    0.06
    .")
    0.06
     swirling
    0.06
    ية
    0.06
    ')==
    0.06
    .'"
    0.06
    Act Density 0.031%

    No Known Activations