Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APIAssistant AxisNEWCircuit TracerNEWSteerSAE EvalsExports Community BlogPrivacy & TermsContact
    © Neuronpedia 2026
    Privacy & TermsBlogGitHubSlackTwitterContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    sentences that state technical explanations or factual/descriptive information (i.e., salient content words in expository sentences).
    gpt-5-mini
    divided into smaller chunks.↵2. Model Parallelism
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 105123
    It detects descriptions of a person’s clothing and physical appearance.
    gpt-5-mini
    ↵belly button; she also wore a pair of tight
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 70374
    The neuron detects numeric expressions—numbers, measurements, and decimal/percentage-like tokens.
    gpt-5-mini
    .0 (0.6--42.7
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 55267
    the neuron responds to content-bearing or topical words (important nouns, verbs, pronouns and discourse markers) rather than function or filler tokens.
    gpt-5-mini
     in response to the ever-changing demands of the modern
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 76062
    sentences or phrases expressing future claims, promises, or predictions (marked by modal/future constructions like "would," "going to," "will").
    gpt-5-mini
    17.↵↵“New roads and high roads”:
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 21511
    legal discussion of standards of review—phrases contrasting questions of law and fact (de novo, standard of review, jurisdiction, etc.).
    gpt-5-mini
     fact issue whatever is involved in reaching that determination. In
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 38277
    signals that a section heading or paragraph-level label (e.g., a titled or colon-ended section start) is beginning.
    gpt-5-mini
     study. Results and conclusions: The patients had upper respiratory
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 63269
    It detects numeric tokens and digit-heavy sequences (numbers, figure/section/table indices and other multi-digit numeric strings).
    gpt-5-mini
     IFN-γ, IL-10, and TNF
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 95160
    tokens carrying strong semantic content or topical importance (salient content words).
    gpt-5-mini
     unsure what is and isn't true and who,
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 1882
    mentions of "viruses" (references to viruses).
    gpt-5-mini
    32}↵================================↵↵Attempts to add the
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 20651
    the neuron highlights salient, information-dense tokens—important content words (main verbs, nouns, numbers) and emphatic punctuation that carry the core facts or claims.
    gpt-5-mini
     U.S. prisoners have been released from North Korea
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 47513
    This neuron detects the definite article "the" (the token " the", especially in phrases like "What is the ...").
    gpt-5-mini
    user↵<bos>What is the t'th term of
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 74100
    tokens that introduce or express an evaluative/opinionative stance (judgments, endorsements, or assessments).
    gpt-5-mini
     Luke Mula↵↵Okay<end_of_turn>↵
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 106969
    This neuron detects conversation turn boundaries — it activates on the end_of_turn token (end of a user turn).
    gpt-5-mini
     carry a graphics card ar<end_of_turn>↵
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 92499
    tokens that mark sentence boundaries or strong punctuation (sentence-ending periods, commas, quotes, exclamations and similar separators).
    gpt-5-mini
     ready for a second round.↵↵Rick got up and
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 113342
    sentences or phrases containing copular verbs (forms of "be") that state existence or identity (e.g., "is/are ...", "is not ...").
    gpt-5-mini
    . Daddy has to go to school and you know what
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 95050
    the neuron detects URL and web-domain fragments (parts of web addresses and links).
    gpt-5-mini
    ://www.independent.co.uk/life-
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 47200
    mentions of the speaker (first-person "I"/"I'm" and similar self-references).
    gpt-5-mini
     and three oak trees. So you can see that piling
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 112016
    sentences that ask questions, especially "how"‑style interrogative phrases (question words and the following verbs).
    gpt-5-mini
    :↵↵How do you equally space out elements in a
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 87003
    tokens that occur at the start of a sentence or turn (sentence/turn-initial words and markers).
    gpt-5-mini
     later Vincent came in."↵Mrs. Napier further testified
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 23548