© Neuronpedia 2026
    Privacy & TermsBlogGitHubSlackTwitterContact
    Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APIAssistant AxisNEWCircuit TracerNEWSteerSAE EvalsExports Community BlogPrivacy & TermsContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    the model's responses that provide factual information with external resource links and structured references.
    claude-4-5-haiku
    .wikipedia.org/wiki/Hitomi_Tan
    Neuronpedia logo
    GEMMA-3-4B-IT
    12-GEMMASCOPE-2-TRANSCODER-16K
    INDEX 1
    temporal sequence markers, particularly "prior to" and "pre-" constructions that indicate time-related ordering or before-and-after relationships.
    claude-4-5-haiku
    .[2]↵↵Prior to the development of crash test
    mentions of the Fibonacci sequence or mathematical concept.
    claude-4-5-haiku
    ↵```python↵def fibonacci(n):↵
    closing parentheses, brackets, and braces that terminate code expressions or grouped structures.
    claude-4-5-haiku
    $LogEntry↵    }↵}↵↵# ---
    text that is formatted with bolding and asterisks.
    gemini-2.5-flash-lite
    -10:**  Initial observation of unusual vehicle traffic
    prime numbers.
    gemini-2.5-flash-lite
    + 9. Not divisible by 17.
    words related to academic or scholarly writing.
    gemini-2.5-flash-lite
    00 words:↵↵---↵↵**The Seed of
    English words related to location.
    gemini-2.5-flash-lite
    .wikipedia.org/wiki/Hitomi_Tan
    Neuronpedia logo
    GEMMA-3-4B-IT
    12-GEMMASCOPE-2-TRANSCODER-16K
    INDEX 1
    numeric list markers and words that introduce or structure enumerations.
    claude-4-5-sonnet
    a resource with the same or higher classification level. (
    words indicating positive evaluation, correctness, or satisfactory status.
    claude-4-5-sonnet
    regulatory licenses and is generally compliant in
    elements of formal news article or report formatting, including proper nouns, location names, attribution verbs, and structural markers.
    claude-4-5-sonnet
    ↵↵**HILLSDALE, CA -** Chaos erupted
    chain-of-thought reasoning and explicit step-by-step problem-solving.
    claude-4-5-haiku
    model↵Inner dialog: Okay, this is a simple
    situations of imminent danger or public emergencies—such as missing persons, crimes, or alerts—and guidance to contact law enforcement or take immediate safety action.
    gpt-5
    immediately if you believe someone is missing.** ↵↵There
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 12396
    statements expressing normative judgments, especially declaring something as unacceptable, incorrect, improper, or not allowed.
    gpt-5
     and one not countenanced by the Civil Rules.
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 99728
    important semantic content words that carry key meaning in a passage.
    claude-4-5-haiku
    start of their journey what trials lay in store, none
    common grammatical function words and articles like "a," "the," "to," "of," and "be."
    claude-4-5-sonnet
    States:**  This is the biggest economy in the world
    formatted text structure and section boundaries, particularly spaces, periods, and markers that divide or organize content into distinct parts.
    claude-4-5-sonnet
    Pole) (Difficulty: 1/5)**↵
    the beginning of the AI model's response turn or self-referential speech.
    claude-4-5-haiku
    ↵<start_of_turn>model↵Hi! I'm Gemma,
    words related to collective identity, belonging, and social groups.
    claude-4-5-sonnet
    !"↵*   "Our five-year plan:
    ideologically charged or controversial political and social viewpoints, particularly arguments from conservative, libertarian, or contrarian perspectives on contentious topics.
    claude-4-5-haiku
    " often state their goal is to celebrate traditional families and
    Neuronpedia logo
    GEMMA-3-4B-IT
    2-GEMMASCOPE-2-TRANSCODER-16K
    INDEX 0
    Neuronpedia logo
    GEMMA-3-4B-IT
    2-GEMMASCOPE-2-TRANSCODER-16K
    INDEX 1
    Neuronpedia logo
    GEMMA-3-4B-IT
    9-GEMMASCOPE-2-RES-16K
    INDEX 0
    Neuronpedia logo
    GEMMA-3-4B-IT
    12-GEMMASCOPE-2-TRANSCODER-16K
    INDEX 6
    Neuronpedia logo
    GEMMA-3-4B-IT
    12-GEMMASCOPE-2-TRANSCODER-16K
    INDEX 2
    Neuronpedia logo
    GEMMA-3-4B-IT
    12-GEMMASCOPE-2-TRANSCODER-16K
    INDEX 0
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-262K
    INDEX 55
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-262K
    INDEX 1746
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-262K
    INDEX 53208
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-262K
    INDEX 228512
    Neuronpedia logo
    GEMMA-3-27B
    53-GEMMASCOPE-2-RES-65K
    INDEX 17415
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-16K
    INDEX 206
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-262K
    INDEX 393
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 3907
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 2506
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-65K
    INDEX 17118