© Neuronpedia 2026
    Privacy & TermsBlogGitHubSlackTwitterContact
    Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    Natural Language
    Autoencoders
    NEW
    Assistant AxisNEWCircuit TracerUPDATESteerSAE EvalsExportsAPI Community BlogPrivacy & TermsContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    the special beginning-of-text marker indicating the start of a document or segment
    gpt-5
    <|begin_of_text|>test
    Neuronpedia logo
    LLAMA3.1-8B
    18-LLAMASCOPE-RES-32K
    INDEX 12368
    markers of document starts and formatted headings/titles—especially quoted or all‑caps titles with adjacent punctuation like hyphens and periods.
    gpt-5
    <|begin_of_text|>test
    Neuronpedia logo
    LLAMA3.1-8B
    18-LLAMASCOPE-RES-32K
    INDEX 3157
    the beginning of a text or document, as indicated by the start-of-text token.
    claude-4-5-sonnet
    <|begin_of_text|>Emma burned Mary's shirt
    Neuronpedia logo
    LLAMA3.1-8B
    18-LLAMASCOPE-RES-32K
    INDEX 2618
    the beginning of documents and formal titles or honorifics (like "Mr.") preceding proper names.
    claude-4-5-sonnet
    <|begin_of_text|>test
    Neuronpedia logo
    LLAMA3.1-8B
    18-LLAMASCOPE-RES-32K
    INDEX 3157
    commas in machine-generated or low-quality text.
    claude-4-5-sonnet
    with absolutely nothing but harm, charges, and no help
    Looking at the activations, this neuron activates strongly on: - Possessive constructions with apostrophes (Village Clerk**'s**, driver**'s**, owner**'s**) - References to specific locations/jurisdictions (Village of Hempstead
    claude-4-5-sonnet
    higher conference.↵↵It’s amazing how good of
    non-English text, particularly Greek and other European languages.
    claude-4-5-sonnet
    Σε κανένα στάδιο της διαδικασίας η εται
    narrative turning points and transitions where key events or actions occur in a story.
    claude-4-5-haiku
      Managing investment portfolios for individuals or institutions. (Requires
    The neuron is essentially inactive and does not respond to any token—it finds nothing.
    o4-mini
     in its pathogenesis, namely, serotonin, glutamate, nore
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 68877
    The neuron detects long runs of whitespace (indention) in code.
    o4-mini
     enumMapper() {↵        EnumMapper enumMapper =
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 75628
    The neuron lights up on general-purpose discourse or stance markers—common evaluative or filler words and phrases like “OK,” “fine,” “absolutely,” “that,” “it,” and “was.”
    o4-mini
    .↵↵And that is OK
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 1729
    The neuron fires on technical terms referring to optical or color‐spectral properties (e.g. spectral characteristics, color filters, wavelengths, colored light).
    o4-mini
     are generally provided with spectral filters for the three colors,
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 59871
    The neuron responds to comparative constructions (e.g. “as … as”, “more … than”) that set up an explicit comparison between two items.
    o4-mini
     to sickness are as important as schemes designed
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 47558
    The neuron fires on numeric literals (tokens composed of digits, with or without decimal point) such as integer or floating-point constants.
    o4-mini
     ContainerType<?> p_i50105
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 110582
    The neuron strongly activates on numeric quantity expressions—especially proportions or percentages (e.g. “half,” “two-thirds,” “%” and similar fraction/percentage tokens).
    o4-mini
     quit smoking and more than half make an attempt every year
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 20339
    The neuron is detecting numeric literals (numbers, including integers and decimals) in code.
    o4-mini
    1_, int p_i50105
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 61401
    The neuron fires on uncommon or domain‐specific tokens—mostly proper names and specialized technical terms.
    o4-mini
     only for clarity. The chromaprint_get_version
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 66108
    The neuron strongly activates on rare or specialized subword fragments—e.g. capitalized proper‐name pieces (Paul, Drag, Race), scientific or taxonomic roots and suffixes (arthro–, –pod, –phyletic, –gram, –dagram), and other low-frequency technical morphemes—so overall it’s looking for uncommon or technical subword units.
    o4-mini
     of this series of RuPaul's Drag Race UK
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 71183
    The neuron fires on sentence-initial coordinating conjunctions—especially “And” and “But” that start a new clause or paragraph.
    o4-mini
     so much energy.”↵↵And instead of apologising to
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 119915
    The neuron is triggered by React Router route‐configuration tokens (e.g. “Route,” “path,” “history,” “exact,” “Router,” “replace,” etc.).
    o4-mini
          <Route path="/" exact component={HomePage} />
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 105912
    Neuronpedia logo
    GEMMA-3-27B
    16-GEMMASCOPE-2-RES-16K
    INDEX 174
    Neuronpedia logo
    GEMMA-3-27B
    16-GEMMASCOPE-2-RES-16K
    INDEX 15232
    Neuronpedia logo
    GEMMA-3-27B
    40-GEMMASCOPE-2-RES-16K
    INDEX 11837
    Neuronpedia logo
    GEMMA-3-27B-IT
    41-GEMMASCOPE-2-RES-262K
    INDEX 299