© Neuronpedia 2026
    Privacy & TermsBlogGitHubSlackTwitterContact
    Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    Natural Language
    Autoencoders
    NEW
    Assistant AxisNEWCircuit TracerUPDATESteerSAE EvalsExportsAPI Community BlogPrivacy & TermsContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    References to death, dying, end-of-life care, and hospice-related situations.
    gpt-5-mini
    Tuesday, surrounded by family, after a long battle with
    Neuronpedia logo
    QWEN3-32B
    32-RESID-BATCHTOPK-65K
    INDEX 20857
    The neuron detects document metadata and boilerplate — headings, bylines/author lines, dates, copyright/contact info and other structural header elements.
    gpt-5-mini
    back to Miss World Philippines 2
    Neuronpedia logo
    QWEN3-32B
    32-RESID-BATCHTOPK-65K
    INDEX 9084
    words that appear frequently at the beginning of sentences or after punctuation marks.
    claude-4-5-haiku
    back to Miss World Philippines 2
    Neuronpedia logo
    QWEN3-32B
    32-RESID-BATCHTOPK-65K
    INDEX 51913
    This neuron detects high-frequency function words, especially the definite article "the" and forms of the verb "to be" (e.g., "is", "are").
    gpt-5-mini
    back to Miss World Philippines 2
    Neuronpedia logo
    QWEN3-32B
    32-RESID-BATCHTOPK-65K
    INDEX 51913
    finds words and tokens expressing possibility or impossibility and the idea that something is difficult (e.g., "possible", "impossible", "task", "monstrous").
    gpt-5-mini
    realise this may well be considered an impossible task that's
    Neuronpedia logo
    QWEN3-32B
    32-RESID-BATCHTOPK-65K
    INDEX 8161
    The neuron detects headings or section titles (tokens that appear in document or post titles/headers, often capitalized and near the start).
    gpt-5-mini
    Archives: Cart abandonment↵↵In our previous post
    Neuronpedia logo
    QWEN3-32B
    32-RESID-BATCHTOPK-65K
    INDEX 37060
    class definitions and method signatures in code.
    claude-4-5-haiku
    ↵↵class JobManagerSpec extends AnyFunSuite {↵↵
    the special beginning-of-text marker indicating the start of a document or segment
    gpt-5
    <|begin_of_text|>test
    Neuronpedia logo
    LLAMA3.1-8B
    18-LLAMASCOPE-RES-32K
    INDEX 12368
    markers of document starts and formatted headings/titles—especially quoted or all‑caps titles with adjacent punctuation like hyphens and periods.
    gpt-5
    <|begin_of_text|>test
    Neuronpedia logo
    LLAMA3.1-8B
    18-LLAMASCOPE-RES-32K
    INDEX 3157
    the beginning of a text or document, as indicated by the start-of-text token.
    claude-4-5-sonnet
    <|begin_of_text|>Emma burned Mary's shirt
    Neuronpedia logo
    LLAMA3.1-8B
    18-LLAMASCOPE-RES-32K
    INDEX 2618
    the beginning of documents and formal titles or honorifics (like "Mr.") preceding proper names.
    claude-4-5-sonnet
    <|begin_of_text|>test
    Neuronpedia logo
    LLAMA3.1-8B
    18-LLAMASCOPE-RES-32K
    INDEX 3157
    commas in machine-generated or low-quality text.
    claude-4-5-sonnet
    with absolutely nothing but harm, charges, and no help
    Looking at the activations, this neuron activates strongly on: - Possessive constructions with apostrophes (Village Clerk**'s**, driver**'s**, owner**'s**) - References to specific locations/jurisdictions (Village of Hempstead
    claude-4-5-sonnet
    higher conference.↵↵It’s amazing how good of
    non-English text, particularly Greek and other European languages.
    claude-4-5-sonnet
    Σε κανένα στάδιο της διαδικασίας η εται
    narrative turning points and transitions where key events or actions occur in a story.
    claude-4-5-haiku
      Managing investment portfolios for individuals or institutions. (Requires
    The neuron is essentially inactive and does not respond to any token—it finds nothing.
    o4-mini
     in its pathogenesis, namely, serotonin, glutamate, nore
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 68877
    The neuron detects long runs of whitespace (indention) in code.
    o4-mini
     enumMapper() {↵        EnumMapper enumMapper =
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 75628
    The neuron lights up on general-purpose discourse or stance markers—common evaluative or filler words and phrases like “OK,” “fine,” “absolutely,” “that,” “it,” and “was.”
    o4-mini
    .↵↵And that is OK
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 1729
    The neuron fires on technical terms referring to optical or color‐spectral properties (e.g. spectral characteristics, color filters, wavelengths, colored light).
    o4-mini
     are generally provided with spectral filters for the three colors,
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 59871
    The neuron responds to comparative constructions (e.g. “as … as”, “more … than”) that set up an explicit comparison between two items.
    o4-mini
     to sickness are as important as schemes designed
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 47558
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-262K
    INDEX 6219
    Neuronpedia logo
    GEMMA-3-27B
    16-GEMMASCOPE-2-RES-16K
    INDEX 174
    Neuronpedia logo
    GEMMA-3-27B
    16-GEMMASCOPE-2-RES-16K
    INDEX 15232
    Neuronpedia logo
    GEMMA-3-27B
    40-GEMMASCOPE-2-RES-16K
    INDEX 11837
    Neuronpedia logo
    GEMMA-3-27B-IT
    41-GEMMASCOPE-2-RES-262K
    INDEX 299