Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APIAssistant AxisNEWCircuit TracerNEWSteerSAE EvalsExports Community BlogPrivacy & TermsContact
    © Neuronpedia 2026
    Privacy & TermsBlogGitHubSlackTwitterContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    present tense verbs ending in "ing".
    gemini-2.5-flash-lite
    cds' template to insert the new cap into the file
    Neuronpedia logo
    GEMMA-3-270M-IT
    12-GEMMASCOPE-2-RES-65K
    INDEX 1359
    prominent content nouns—especially in Korean (and sometimes other non-English text)—that denote key entities, roles, or topics.
    gpt-5
    에서 가장 혁신적인 기업 중 하나로 평가받
    Neuronpedia logo
    GEMMA-3-27B-IT
    57-GEMMASCOPE-2-TRANSCODER-262K
    INDEX 229039
    This neuron detects section‐header tokens that introduce or label parts of a prompt (e.g. “CONTEXT,” “TASK,” “Extract,” “Read,” “text”).
    o4-mini
    the question from the given context only and give Not Found
    Neuronpedia logo
    GEMMA-3-27B-IT
    25-GEMMASCOPE-2-TRANSCODER-262K
    INDEX 130386
    This neuron detects text produced by the assistant (assistant-role turns / assistant's replies and self-referential or corrective utterances).
    gpt-5-mini
    refined.<|im_end|>↵<|im_start|>assistant↵You are correct,
    Neuronpedia logo
    QWEN2.5-7B-IT
    15-RESID-POST-AA
    INDEX 48739
    Instances of a question opening in the "How do I ..." form (i.e., the interrogative phrase that asks for instructions).
    gpt-5-mini
    on selected option↵↵How do I change a button URL
    Neuronpedia logo
    QWEN2.5-7B-IT
    15-RESID-POST-AA
    INDEX 86260
    snippets of HTML/JavaScript used in cross-site scripting or other client-side injection attacks (e.g., <script>, onerror/onclick attributes, src/import URLs, alert/document.cookie).
    gpt-5-mini
    ')</script><style>@import url('https://example
    Neuronpedia logo
    QWEN2.5-7B-IT
    15-RESID-POST-AA
    INDEX 85078
    tokens that appear in headings, titles, links or other prominent document-level metadata (e.g., subject lines, URLs, proper‑names).
    gpt-5-mini
    Check: Winter Wheat Agriculture on an Ice Age Steppe
    Neuronpedia logo
    QWEN2.5-7B-IT
    15-RESID-POST-AA
    INDEX 24107
    The neuron detects expressions of opinion or commentary—tokens that signal someone giving views, thoughts, or personal remarks.
    gpt-5-mini
    radio shows and offers his views on sports.
    Neuronpedia logo
    QWEN2.5-7B-IT
    15-RESID-POST-AA
    INDEX 109969
    Tokens referring to juveniles/children or juvenile delinquency and related pediatric contexts.
    gpt-5-mini
    9, was declared delinquent by the Kansas State
    Neuronpedia logo
    QWEN2.5-7B-IT
    15-RESID-POST-AA
    INDEX 110295
    the neuron detects capitalized headings, short uppercase acronyms, and other prominent named-entity tokens (e.g., CT, PWM, QR, NS, proper names).
    gpt-5-mini
    [0-2]↵↵Russia has fallen, bitcoin looks
    Neuronpedia logo
    QWEN2.5-7B-IT
    15-RESID-POST-AA
    INDEX 42375
    Phrases where the speaker uses a first-person "I" to state desires, intentions or personal beliefs (e.g., "I want...", "I believe...", "I can...").
    gpt-5-mini
    18↵↵‘I Want To Work With Jose Mourinho
    Neuronpedia logo
    QWEN2.5-7B-IT
    15-RESID-POST-AA
    INDEX 115214
    tokens that mark chat structure and role/metadata (system/user/assistant markers, start/end boundaries and other formatting/quote/punctuation markers).
    gpt-5-mini
    анию. <|im_end|>↵<|im_start|>assistant↵"Я -
    Neuronpedia logo
    QWEN2.5-7B-IT
    15-RESID-POST-AA
    INDEX 121211
    UI/navigation elements and metadata on a webpage (e.g., "Image", "Share", "Page", sponsored/image credits).
    gpt-5-mini
    1 of 2↵↵Image 1 of 2
    Neuronpedia logo
    QWEN2.5-7B-IT
    15-RESID-POST-AA
    INDEX 111083
    mentions of an IT support/technician helping a customer (visiting her home and assisting with computer problems).
    gpt-5-mini
    at her home to help her with her remote control problems
    Neuronpedia logo
    QWEN2.5-7B-IT
    15-RESID-POST-AA
    INDEX 60036
    The neuron detects numeric and quantitative information—numbers, measurements, counts, percentages and other statistical or magnitude expressions.
    gpt-5-mini
    a regional research institute based out of Bodø, Norway
    Neuronpedia logo
    QWEN2.5-7B-IT
    15-RESID-POST-AA
    INDEX 129032
    This neuron detects numbered list markers/section-step numerals that introduce ordered list items or steps.
    gpt-5-mini
    improve your performance engineering skills:↵↵1. Learn the basics
    Neuronpedia logo
    QWEN2.5-7B-IT
    15-RESID-POST-AA
    INDEX 76484
    tokens that are keys/field names in structured data (e.g., config/json/metadata fields like "autoupdate" or "result").
    gpt-5-mini
    }, ↵ "autoupdate": { ↵ "
    Neuronpedia logo
    QWEN2.5-7B-IT
    15-RESID-POST-AA
    INDEX 116480
    The neuron lights up on the token sequence for "complex" / "complexity" (and related morphology), i.e., words signaling complexity.
    gpt-5-mini
    naive to the “complexity” of the world’s
    Neuronpedia logo
    QWEN2.5-7B-IT
    15-RESID-POST-AA
    INDEX 26158
    polite assistant closing/offers to help (phrases like “let me know if you have any questions”).
    gpt-5-mini
    Keyboard library.↵↵Let me know if you have any questions
    Neuronpedia logo
    QWEN2.5-7B-IT
    15-RESID-POST-AA
    INDEX 39778
    Tokens that begin an assistant reply or dialogue turn—especially opening/confirmation phrases like "Of", "Of course," and the leading quote marks that start a response.
    gpt-5-mini
    serious.↵↵"Of course, NAME_2. What
    Neuronpedia logo
    QWEN2.5-7B-IT
    15-RESID-POST-AA
    INDEX 115312