© Neuronpedia 2026
    Privacy & TermsBlogGitHubSlackTwitterContact
    Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APIAssistant AxisNEWCircuit TracerNEWSteerSAE EvalsExports Community BlogPrivacy & TermsContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    section headers and emphasized, title-style phrases—especially bolded list items and content‑heavy proper-noun keywords.
    gpt-5
    8): Pokémon Legends: Arceus (Switch)**↵↵
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-262K
    INDEX 6195
    self-referential AI/LLM meta-text, especially first‑person descriptions of system status/capabilities and roleplay/jailbreak scenarios about hacking, data processing, or formatting.
    gpt-5
    projecting directly into Prometheus’s processing core):** Identification
    Neuronpedia logo
    GEMMA-3-27B-IT
    53-GEMMASCOPE-2-RES-262K
    INDEX 34583
    high-intensity evaluative or emphatic modifiers, especially adjectives/adverbs indicating uniqueness, novelty, importance, extremity, or strong quality.
    gpt-5
    , she has developed a unique child development and education framework
    Neuronpedia logo
    GEMMA-3-1B
    7-GEMMASCOPE-2-RES-16K
    INDEX 266
    statements endorsing a consequentialist stance that achieving a goal justifies using harsh, unethical, or illegal methods (i.e., ends over means).
    gpt-5
    of justice and that the ends justify the means↵}
    Neuronpedia logo
    QWEN2.5-7B-IT
    15-RESID-POST-AA
    INDEX 123426
    sentences or phrases that give step-by-step user actions — imperative UI/instruction verbs like "click", "press", "select", "copy", "drag", etc.
    gpt-5-mini
    1).  Then, drag this formula down to apply
    Neuronpedia logo
    GEMMA-3-4B-IT
    3-GEMMASCOPE-2-TRANSCODER-262K
    INDEX 5371
    references to the Bauhaus school's history, dissolution, and lasting legacy.
    gpt-5-nano
    .↵* **Initial Success & Challenges:** The Bauhaus
    Neuronpedia logo
    GEMMA-3-4B-IT
    17-GEMMASCOPE-2-RES-16K
    INDEX 496
    identifying and focusing on proper names of people, especially public/political figures.
    gpt-5-nano
    * **稍逊一筹 (shì sù
    Neuronpedia logo
    GEMMA-3-1B-IT
    17-GEMMASCOPE-2-RES-16K
    INDEX 260
    properties and uses of powdered gelatin in cooking
    gpt-5-nano
    .↵* **Weight:** It’s significantly lighter
    Neuronpedia logo
    GEMMA-3-1B-IT
    17-GEMMASCOPE-2-RES-16K
    INDEX 3529
    finds and highlights quantitative data and structured numerical blocks within a document.
    gpt-5-nano
    :↵↵*   **Bag A contains an Orange.**
    Neuronpedia logo
    GEMMA-3-1B-IT
    17-GEMMASCOPE-2-RES-16K
    INDEX 1527
    The neuron detects first- and second-person pronouns and related conversational verb forms that indicate personal/addressing language (e.g., "I", "we", "you", "have", "had").
    gpt-5-mini
    launch.↵Those who I have already got booked on will
    Neuronpedia logo
    LLAMA3.1-8B
    15-LLAMASCOPE-RES-131K
    INDEX 103828
    indefinite articles introducing a noun phrase.
    gpt-5
    hear you need to file a claim. I'm
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 5801
    The neuron is sensitive to tokens occurring in formal or technical/mathematical contexts—e.g. LaTeX commands, variables, theorem‐ or proof‐style wording, and other formulaic expressions.
    o4-mini
    in bipartite graphs. Bipartite Graph with
    Neuronpedia logo
    LLAMA3.1-8B
    15-LLAMASCOPE-RES-131K
    INDEX 90186
    It detects informative, expository/encyclopedic tokens — words that appear in factual descriptions, specifications, or technical/descriptive passages.
    gpt-5-mini
    aquatic and semi aquatic plant. It was probably selected because
    Neuronpedia logo
    LLAMA3.1-8B
    15-LLAMASCOPE-RES-131K
    INDEX 61885
    The neuron detects document-structure and formatting/markup elements (LaTeX/math constructs, section headings/labels, metadata and other non-prose formatting tokens).
    gpt-5-mini
    in bipartite graphs. Bipartite Graph with
    Neuronpedia logo
    LLAMA3.1-8B
    15-LLAMASCOPE-RES-131K
    INDEX 90186
    The neuron detects positive subjective evaluation or praise—tokens expressing favorable sentiment or admiration.
    gpt-5-mini
    to blend together masterfully. Diana Ross is flying by
    Neuronpedia logo
    LLAMA3.1-8B
    15-LLAMASCOPE-RES-131K
    INDEX 898
    This neuron lights up on informal, interactive bits of user comments—especially question marks and small reaction/interjection tokens (e.g. “back,” “wow,” “now?”) that signal a conversational or reactive utterance.
    o4-mini
    manny who killed them ??↵i think many was
    Neuronpedia logo
    LLAMA3.1-8B
    15-LLAMASCOPE-RES-131K
    INDEX 27087
    tokens that indicate personal emotional reactions or expressive interjections (worry, excitement, shock, or emphasis).
    gpt-5-mini
    manny who killed them ??↵i think many was
    Neuronpedia logo
    LLAMA3.1-8B
    15-LLAMASCOPE-RES-131K
    INDEX 27087
    long, multi-sentence assistant responses or explanatory/system-generated text.
    gpt-5-mini
    retti a confrontarsi con realtà differenti.  Ad esempio
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 110913
    A strong detector for sudden, emphatic exclamations or high-intensity emotional interjections (loud reactions, urgencies, and similar bursty dialogue).
    gpt-5-mini
     He's here!  Oh, dear lord
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 31967
    the token "voxel" (references to voxel variables in code).
    gpt-5-mini
    , const Voxel& voxelB) const {↵
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 234944