Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APIAssistant AxisNEWCircuit TracerNEWSteerSAE EvalsExports Community BlogPrivacy & TermsContact
    © Neuronpedia 2025
    Privacy & TermsBlogGitHubSlackTwitterContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    This neuron detects numerical information—numbers and quantities (times, measurements, counts) in the text.
    gpt-5-mini
    the NAME_1<|eot_id|><|start_header_id|>assistant<|end_header_id|>↵↵It is
    Neuronpedia logo
    LLAMA3.1-8B-IT
    15-RESID-POST-AA
    INDEX 79869
    The neuron detects mentions of people/agents and reporting actions—tokens that mark participants, speakers, or verbs introducing reported speech or descriptions.
    gpt-5-mini
    range of departments and markets all speak about the 53
    Neuronpedia logo
    LLAMA3.1-8B-IT
    15-RESID-POST-AA
    INDEX 94435
    the presence of numeric or quantitative information (numbers, totals, measurements) or tokens in factual/data-heavy contexts.
    gpt-5-mini
    , giving him 12 total Igors.↵↵A few
    Neuronpedia logo
    LLAMA3.1-8B-IT
    15-RESID-POST-AA
    INDEX 14994
    tokens that appear at the start of a sentence or as speaker/answer labels (sentence-initial or speaker-label tokens).
    gpt-5-mini
    www.mayfairnewyork.com.↵↵Thanks for
    Neuronpedia logo
    LLAMA3.1-8B-IT
    15-RESID-POST-AA
    INDEX 11329
    utterances expressing the speaker's personal opinion, advice, or judgment.
    gpt-5-mini
    confusing you even though having a good connection.\n\n
    Neuronpedia logo
    LLAMA3.1-8B-IT
    15-RESID-POST-AA
    INDEX 37576
    Sentences that describe medical conditions or give factual explanations about their causes, severity, and treatments.
    gpt-5-mini
    undice is usually harmless and can be treated with phot
    Neuronpedia logo
    LLAMA3.1-8B-IT
    15-RESID-POST-AA
    INDEX 24773
    This neuron detects Cyrillic-script (Russian) tokens — i.e., Russian-language text.
    gpt-5-mini
    ильно" или "неправильно". П
    Neuronpedia logo
    LLAMA3.1-8B-IT
    15-RESID-POST-AA
    INDEX 120130
    phrases that express permission, ability, or the availability of a feature (words like "allow(s)", "ability", "allowing", etc.).
    gpt-5-mini
    was impressed by the trick of writing full cache lines being
    Neuronpedia logo
    LLAMA3.1-8B-IT
    15-RESID-POST-AA
    INDEX 59711
    phrasing that introduces a user instruction or request (e.g., "my first question is", "I want you to...", question/colon markers).
    gpt-5-mini
    my 1st question is: give me 5
    Neuronpedia logo
    LLAMA3.1-8B-IT
    15-RESID-POST-AA
    INDEX 74905
    special conversation/formatting tokens and metadata markers (role tags like "user"/"assistant" and header/end-of-text markers).
    gpt-5-mini
    copy?<|eot_id|><|start_header_id|>assistant<|end_header_id|>↵↵The person responsible for
    Neuronpedia logo
    LLAMA3.1-8B-IT
    15-RESID-POST-AA
    INDEX 30575
    Looks for special control/header tokens and conversation boundary markers (e.g., start/end/eot and speaker-header tokens).
    gpt-5-mini
    private boat skipper.<|eot_id|><|start_header_id|>assistant<|end_header_id|>↵↵Travel
    Neuronpedia logo
    LLAMA3.1-8B-IT
    15-RESID-POST-AA
    INDEX 112670
    tokens that occur in instruction/task-setting prompts (imperative or role directives), i.e., words used when the user tells the model what to do.
    gpt-5-mini
    of English. Assume that you possess the ability to craft
    Neuronpedia logo
    LLAMA3.1-8B-IT
    15-RESID-POST-AA
    INDEX 74050
    Tokens that occur in metadata or boilerplate parts of documents (timestamps, site/product info and similar routine/interface text).
    gpt-5-mini
    of purchase will apply to the purchase of this product.↵↵
    Neuronpedia logo
    LLAMA3.1-8B-IT
    15-RESID-POST-AA
    INDEX 41419
    The neuron detects special structural/control tokens and metadata boundaries (e.g., header/start/end markers and similar document-level tokens).
    gpt-5-mini
    al gust<|eot_id|><|start_header_id|>user<|end_header_id|>↵↵Scusa ma non
    Neuronpedia logo
    LLAMA3.1-8B-IT
    15-RESID-POST-AA
    INDEX 20611
    Tokens marking the assistant role or assistant message header (i.e., the "<|assistant|>"/assistant header indicator).
    gpt-5-mini
    آبی چیست؟<|eot_id|><|start_header_id|>assistant<|end_header_id|>↵↵The difference between
    Neuronpedia logo
    LLAMA3.1-8B-IT
    15-RESID-POST-AA
    INDEX 105948
    Named entities and specific proper nouns (people, places, organizations, product/model names and technical terms).
    gpt-5-mini
    Orao. It is in Serbian/Croatian
    Neuronpedia logo
    LLAMA3.1-8B-IT
    15-RESID-POST-AA
    INDEX 84075
    tokens that occur inside or adjacent to direct speech/quotation (dialogue).
    gpt-5-mini
    Wait,” I heard. “Wait.”↵↵“How long?”
    Neuronpedia logo
    LLAMA3.1-8B-IT
    15-RESID-POST-AA
    INDEX 114562
    tokens that mark the assistant/response header or conversation boundary (assistant role/header delimiter tokens).
    gpt-5-mini
    id.<|eot_id|><|start_header_id|>assistant<|end_header_id|>↵↵Here's an example
    Neuronpedia logo
    LLAMA3.1-8B-IT
    15-RESID-POST-AA
    INDEX 94832
    the neuron detects tokens that are parts of contractions (pieces containing an apostrophe).
    gpt-5-mini
    beach, we weren’t able to properly survey the area
    Neuronpedia logo
    LLAMA3.1-8B-IT
    15-RESID-POST-AA
    INDEX 10337
    sentences or phrases that give instructions, how-to steps, or action-oriented guidance.
    gpt-5-mini
    , would love to be able to quickly post across news
    Neuronpedia logo
    LLAMA3.1-8B-IT
    15-RESID-POST-AA
    INDEX 91276