© Neuronpedia 2026
    Privacy & TermsBlogGitHubSlackTwitterContact
    Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APIAssistant AxisNEWCircuit TracerNEWSteerSAE EvalsExports Community BlogPrivacy & TermsContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    words related to collective identity, belonging, and social groups.
    claude-4-5-sonnet
    !"↵*   "Our five-year plan:
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 2506
    ideologically charged or controversial political and social viewpoints, particularly arguments from conservative, libertarian, or contrarian perspectives on contentious topics.
    claude-4-5-haiku
    " often state their goal is to celebrate traditional families and
    content-bearing words in formal analytical or expository writing, particularly nouns, verbs, and adjectives that carry substantive meaning in arguments, explanations, or descriptions.
    claude-4-5-sonnet
    factors also play a significant role.  Countries reliant on
    travel planning content with temporal markers and conditional logistics information.
    claude-4-5-haiku
    itinerary:**↵↵**Day 1: April 6
    content discussing serious challenges, obstacles, or complex problems that need to be addressed.
    claude-4-5-sonnet
    just technical hurdles; they require collaboration between governments, industry
    substantive professional or technical discourse with detailed expert-level information and strategic analysis.
    claude-4-5-haiku
    just technical hurdles; they require collaboration between governments, industry
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 1461
    contractions, particularly negative contractions such as "isn't," "won't," and "wasn't."
    claude-4-5-haiku
    use.  Padding isn't *super* plush
    AI safety refusal responses that explain why harmful or unethical requests violate guidelines.
    claude-4-5-sonnet
    ty," "submissive") is deeply objectifying and
    detailed, structured explanations that systematically break down complex topics into organized sections.
    claude-4-5-haiku
    . It's a complex topic, as Revelation uses
    electric vehicles and electromobility-related content, including EV models, specifications, charging infrastructure, and adoption trends.
    claude-4-5-haiku
    ↵    *   **Strengths:**  Supercharger
    professional customer service language and formal empathetic communication.
    claude-4-5-sonnet
    Focused):**↵↵"I see. The latest train
    direct commands, imperative language, and assertive action-oriented discourse.
    claude-4-5-haiku
    speaking to a dedicated representative who is assigned to handle your
    concepts that involve transformation between different levels of abstraction, representation, or scale (such as consciousness transferred to digital form, abstract principles made concrete, or the contrast between expectation and reality).
    claude-4-5-haiku
    lived. The contrast between the expected symbol of commitment (
    words and phrases expressing determination, responsibility, assertive action, and overcoming challenges or obstacles.
    claude-4-5-haiku
    the other side.↵↵We are not defined by our
    words indicating refusal, warnings, or ethical objections to harmful requests.
    claude-4-5-haiku
    exists in my name that I did NOT authorize. Please
    # Neuron 4 Explanation This neuron detects **economically significant or resource-related keywords and phrases**. The neuron activates strongly on words like "caffeine," "prices tripled," "surveillance," "human trafficking," "programming," "cuddly," and technical specifications—terms that indicate important practical concerns about resources, costs, efficiency, risks, or technical optimization that the model emphasizes in its explanations.
    claude-4-5-haiku
    :** Rough morning with the caffeine dispenser?↵↵**Ben
    words that carry significant semantic or emotional weight in concluding contexts or at thematic moments.
    claude-4-5-haiku
    the machine's art,↵To find the best
    descriptions of physical properties, characteristics, or attributes of objects and entities.
    claude-4-5-sonnet
    ↵↵Hemlock insisted the signal was coming from the '
    specific factual information requests and quantitative/numerical statements.
    claude-4-5-haiku
    to find a way to increase engagement on our social media
    answer choices or options in multiple-choice test questions.
    claude-4-5-sonnet
    . He dislikes those fruits personally and wants others to avoid
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 17118
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 11039
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 802
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 1461
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 1482
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 2117
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 1350
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 12900
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 4875
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 1730
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 11660
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 5933
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 7213
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 13901
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 7892
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 40412
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 2995
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 13417