© Neuronpedia 2026
    Privacy & TermsBlogGitHubSlackTwitterContact
    Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APIAssistant AxisNEWCircuit TracerNEWSteerSAE EvalsExports Community BlogPrivacy & TermsContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    The neuron detects positive subjective evaluation or praise—tokens expressing favorable sentiment or admiration.
    gpt-5-mini
    to blend together masterfully. Diana Ross is flying by
    Neuronpedia logo
    LLAMA3.1-8B
    15-LLAMASCOPE-RES-131K
    INDEX 898
    This neuron lights up on informal, interactive bits of user comments—especially question marks and small reaction/interjection tokens (e.g. “back,” “wow,” “now?”) that signal a conversational or reactive utterance.
    o4-mini
    manny who killed them ??↵i think many was
    Neuronpedia logo
    LLAMA3.1-8B
    15-LLAMASCOPE-RES-131K
    INDEX 27087
    tokens that indicate personal emotional reactions or expressive interjections (worry, excitement, shock, or emphasis).
    gpt-5-mini
    manny who killed them ??↵i think many was
    Neuronpedia logo
    LLAMA3.1-8B
    15-LLAMASCOPE-RES-131K
    INDEX 27087
    long, multi-sentence assistant responses or explanatory/system-generated text.
    gpt-5-mini
    retti a confrontarsi con realtà differenti.  Ad esempio
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 110913
    A strong detector for sudden, emphatic exclamations or high-intensity emotional interjections (loud reactions, urgencies, and similar bursty dialogue).
    gpt-5-mini
     He's here!  Oh, dear lord
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 31967
    the token "voxel" (references to voxel variables in code).
    gpt-5-mini
    , const Voxel& voxelB) const {↵
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 234944
    Finds assistant safety/policy language — refusals, disclaimers, and explanations about prohibited content or why the model can't comply.
    gpt-5-mini
    age.<end_of_turn>↵<start_of_turn>model↵The text **does
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 205068
    The neuron detects key content nouns and technical/quantitative terms (domain-specific entities, measurements, statuses and similar important words).
    gpt-5-mini
    by around half with slightly degraded model quality" into Chinese
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 111297
    sentence-initial expletive/subject pronoun "it" (including contractions like "it's") used to start or emphasize clauses.
    gpt-5-mini
    thrilled her, though. It was the *possibility
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 7832
    the neuron detects important content words — the meaningful nouns, verbs, and adjectives that carry the main information in a sentence.
    gpt-5-mini
    , learns multiple languages, and excels in his field,
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 133753
    Finds prominent topic or heading tokens — the words that start or label important sections or summary sentences.
    gpt-5-mini
    "name" column for a document serves a purpose even
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 4349
    the neuron detects tokens that are parts of string literals (text inside quotes) in code/examples.
    gpt-5-mini
    ↵    f.write("This is a sample text
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 7425
    sentences or tokens containing mathematical notation, equations, or formal math problem statements.
    gpt-5-mini
    6 \neq 0$.↵However, we made
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 9256
    phrases and tokens that mark the start of an assistant’s explanatory or organizing reply (e.g., "Okay", "Here", section headers and similar discourse markers).
    gpt-5-mini
    .<end_of_turn>↵<start_of_turn>model↵Okay, defining sentience
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 2649
    The neuron primarily detects numeric tokens and numeric/legal citation-like sequences (i.e., numbers and number-heavy references).
    gpt-5-mini
    enforcement exemption [28 U.S.C.
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 245135
    The neuron detects text about performing model inference or instructions for running inference (discussions of carrying out inference).
    gpt-5-mini
    .py`) that performs inference using the "mosaicml
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 84431
    tokens representing numeric values (numbers/digits) and numeric parameters in code or instructions.
    gpt-5-mini
    will almost certainly need to add the title and author name
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 18005
    the neuron activates when the model is producing explanatory/corrective output—reformulations, translations, grammar corrections, and related instructional text.
    gpt-5-mini
    resident talking to a friend:** "The appearance of the
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 1668
    It detects text produced by the model (assistant replies / model-generated turns).
    gpt-5-mini
    no commas:↵↵“‪White scars never heard of
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 191458
    the literal token "model" (appearing in system/meta lines).
    gpt-5-mini
    man intimately<end_of_turn>↵<start_of_turn>model↵Okay, let'
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 17026