© Neuronpedia 2026
    Privacy & TermsBlogGitHubSlackTwitterContact
    Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APIAssistant AxisNEWCircuit TracerNEWSteerSAE EvalsExports Community BlogPrivacy & TermsContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    numeric list markers and words that introduce or structure enumerations.
    claude-4-5-sonnet
    a resource with the same or higher classification level. (
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-262K
    INDEX 55
    words indicating positive evaluation, correctness, or satisfactory status.
    claude-4-5-sonnet
    regulatory licenses and is generally compliant in
    elements of formal news article or report formatting, including proper nouns, location names, attribution verbs, and structural markers.
    claude-4-5-sonnet
    ↵↵**HILLSDALE, CA -** Chaos erupted
    chain-of-thought reasoning and explicit step-by-step problem-solving.
    claude-4-5-haiku
    model↵Inner dialog: Okay, this is a simple
    situations of imminent danger or public emergencies—such as missing persons, crimes, or alerts—and guidance to contact law enforcement or take immediate safety action.
    gpt-5
    immediately if you believe someone is missing.** ↵↵There
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 12396
    statements expressing normative judgments, especially declaring something as unacceptable, incorrect, improper, or not allowed.
    gpt-5
     and one not countenanced by the Civil Rules.
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 99728
    important semantic content words that carry key meaning in a passage.
    claude-4-5-haiku
    start of their journey what trials lay in store, none
    common grammatical function words and articles like "a," "the," "to," "of," and "be."
    claude-4-5-sonnet
    States:**  This is the biggest economy in the world
    formatted text structure and section boundaries, particularly spaces, periods, and markers that divide or organize content into distinct parts.
    claude-4-5-sonnet
    Pole) (Difficulty: 1/5)**↵
    the beginning of the AI model's response turn or self-referential speech.
    claude-4-5-haiku
    ↵<start_of_turn>model↵Hi! I'm Gemma,
    words related to collective identity, belonging, and social groups.
    claude-4-5-sonnet
    !"↵*   "Our five-year plan:
    ideologically charged or controversial political and social viewpoints, particularly arguments from conservative, libertarian, or contrarian perspectives on contentious topics.
    claude-4-5-haiku
    " often state their goal is to celebrate traditional families and
    content-bearing words in formal analytical or expository writing, particularly nouns, verbs, and adjectives that carry substantive meaning in arguments, explanations, or descriptions.
    claude-4-5-sonnet
    factors also play a significant role.  Countries reliant on
    travel planning content with temporal markers and conditional logistics information.
    claude-4-5-haiku
    itinerary:**↵↵**Day 1: April 6
    content discussing serious challenges, obstacles, or complex problems that need to be addressed.
    claude-4-5-sonnet
    just technical hurdles; they require collaboration between governments, industry
    substantive professional or technical discourse with detailed expert-level information and strategic analysis.
    claude-4-5-haiku
    just technical hurdles; they require collaboration between governments, industry
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 1461
    contractions, particularly negative contractions such as "isn't," "won't," and "wasn't."
    claude-4-5-haiku
    use.  Padding isn't *super* plush
    AI safety refusal responses that explain why harmful or unethical requests violate guidelines.
    claude-4-5-sonnet
    ty," "submissive") is deeply objectifying and
    detailed, structured explanations that systematically break down complex topics into organized sections.
    claude-4-5-haiku
    . It's a complex topic, as Revelation uses
    electric vehicles and electromobility-related content, including EV models, specifications, charging infrastructure, and adoption trends.
    claude-4-5-haiku
    ↵    *   **Strengths:**  Supercharger
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-262K
    INDEX 1746
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-262K
    INDEX 53208
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-262K
    INDEX 228512
    Neuronpedia logo
    GEMMA-3-27B
    53-GEMMASCOPE-2-RES-65K
    INDEX 17415
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-16K
    INDEX 206
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-262K
    INDEX 393
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 3907
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 2506
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 17118
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 11039
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 802
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 1461
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 1482
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 2117
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 1350
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 12900