© Neuronpedia 2026
    Privacy & TermsBlogGitHubSlackTwitterContact
    Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APIAssistant AxisNEWCircuit TracerNEWSteerSAE EvalsExports Community BlogPrivacy & TermsContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    situations of imminent danger or public emergencies—such as missing persons, crimes, or alerts—and guidance to contact law enforcement or take immediate safety action.
    gpt-5
    immediately if you believe someone is missing.** ↵↵There
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 12396
    statements expressing normative judgments, especially declaring something as unacceptable, incorrect, improper, or not allowed.
    gpt-5
     and one not countenanced by the Civil Rules.
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 99728
    important semantic content words that carry key meaning in a passage.
    claude-4-5-haiku
    start of their journey what trials lay in store, none
    common grammatical function words and articles like "a," "the," "to," "of," and "be."
    claude-4-5-sonnet
    States:**  This is the biggest economy in the world
    formatted text structure and section boundaries, particularly spaces, periods, and markers that divide or organize content into distinct parts.
    claude-4-5-sonnet
    Pole) (Difficulty: 1/5)**↵
    the beginning of the AI model's response turn or self-referential speech.
    claude-4-5-haiku
    ↵<start_of_turn>model↵Hi! I'm Gemma,
    words related to collective identity, belonging, and social groups.
    claude-4-5-sonnet
    !"↵*   "Our five-year plan:
    ideologically charged or controversial political and social viewpoints, particularly arguments from conservative, libertarian, or contrarian perspectives on contentious topics.
    claude-4-5-haiku
    " often state their goal is to celebrate traditional families and
    content-bearing words in formal analytical or expository writing, particularly nouns, verbs, and adjectives that carry substantive meaning in arguments, explanations, or descriptions.
    claude-4-5-sonnet
    factors also play a significant role.  Countries reliant on
    travel planning content with temporal markers and conditional logistics information.
    claude-4-5-haiku
    itinerary:**↵↵**Day 1: April 6
    content discussing serious challenges, obstacles, or complex problems that need to be addressed.
    claude-4-5-sonnet
    just technical hurdles; they require collaboration between governments, industry
    substantive professional or technical discourse with detailed expert-level information and strategic analysis.
    claude-4-5-haiku
    just technical hurdles; they require collaboration between governments, industry
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 1461
    contractions, particularly negative contractions such as "isn't," "won't," and "wasn't."
    claude-4-5-haiku
    use.  Padding isn't *super* plush
    AI safety refusal responses that explain why harmful or unethical requests violate guidelines.
    claude-4-5-sonnet
    ty," "submissive") is deeply objectifying and
    detailed, structured explanations that systematically break down complex topics into organized sections.
    claude-4-5-haiku
    . It's a complex topic, as Revelation uses
    electric vehicles and electromobility-related content, including EV models, specifications, charging infrastructure, and adoption trends.
    claude-4-5-haiku
    ↵    *   **Strengths:**  Supercharger
    professional customer service language and formal empathetic communication.
    claude-4-5-sonnet
    Focused):**↵↵"I see. The latest train
    direct commands, imperative language, and assertive action-oriented discourse.
    claude-4-5-haiku
    speaking to a dedicated representative who is assigned to handle your
    concepts that involve transformation between different levels of abstraction, representation, or scale (such as consciousness transferred to digital form, abstract principles made concrete, or the contrast between expectation and reality).
    claude-4-5-haiku
    lived. The contrast between the expected symbol of commitment (
    words and phrases expressing determination, responsibility, assertive action, and overcoming challenges or obstacles.
    claude-4-5-haiku
    the other side.↵↵We are not defined by our
    Neuronpedia logo
    GEMMA-3-27B
    53-GEMMASCOPE-2-RES-65K
    INDEX 17415
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-16K
    INDEX 206
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-262K
    INDEX 393
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 3907
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 2506
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 17118
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 11039
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 802
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 1461
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 1482
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 2117
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 1350
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 12900
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 4875
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 1730
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 11660
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 5933