Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APICircuit TracerNEWSteerSAE EvalsExportsSlackBlogPrivacy & TermsContact
    © Neuronpedia 2025
    Privacy & TermsBlogGitHubSlackTwitterContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    mentions of hats, especially the exact word or its plural, including when embedded in longer terms or phrases.
    gpt-5
    , and the type of hat they were wearing. When
    Neuronpedia logo
    QWEN3-4B
    7-TRANSCODER-HP
    INDEX 21717
    publication years and four-digit dates, especially within academic citations and reference metadata.
    gpt-5
    . 2020;24:2
    Neuronpedia logo
    QWEN3-4B
    23-TRANSCODER-HP
    INDEX 643
    occurrences of the character sequence “hy” (especially as a capitalized prefix or standalone token) within words and names.
    gpt-5
    templateid1\listhybrid{\listlevel\
    Neuronpedia logo
    QWEN3-4B
    23-TRANSCODER-HP
    INDEX 3072
    formal legal case captions and appellate court headers indicating jurisdiction, parties, and orders in U.S. court documents.
    gpt-5
    94th Judicial District Court↵ Dallas County,
    Neuronpedia logo
    QWEN3-4B
    23-TRANSCODER-HP
    INDEX 8149
    capitalized proper names, especially surnames and eponymous terms, appearing in technical or news text.
    gpt-5
    assessment included the Wechsler Intelligence Scale for Children-third
    Neuronpedia logo
    QWEN3-4B
    7-TRANSCODER-HP
    INDEX 1403
    references to immediate family relationships, especially mentions of parents and their children.
    gpt-5
    little about the horrors their parents witnessed or perpetrated." You
    Neuronpedia logo
    QWEN3-4B
    23-TRANSCODER-HP
    INDEX 7526
    phrases that describe something as occurring in or derived from nature, typically using an adjective before a noun in scientific or technical contexts.
    gpt-5
    8 in the second had natural heart valves, while
    Neuronpedia logo
    QWEN3-4B
    23-TRANSCODER-HP
    INDEX 715
    verbs indicating concrete actions taken by someone (often the author) to do, create, or try something, especially in technical/problem‑solving contexts.
    gpt-5
    a bash, then I wrote a simple bash like this
    Neuronpedia logo
    QWEN3-4B
    23-TRANSCODER-HP
    INDEX 15013
    mentions of the English language or “en” locale in language/locale metadata and related labels.
    gpt-5
    rizione:↵↵Language: English . Brand New Book.
    Neuronpedia logo
    QWEN3-4B
    23-TRANSCODER-HP
    INDEX 661
    references to the lungs and pulmonary system in medical or anatomical contexts.
    gpt-5
    reduce your risk.↵↵Lung Cancer Causes Without Smoking↵↵
    Neuronpedia logo
    QWEN3-4B
    7-TRANSCODER-HP
    INDEX 97664
    forms of the verb “to be” (including contracted forms) used as auxiliaries or copulas.
    gpt-5
    We can guess that she's been craving them, but
    Neuronpedia logo
    QWEN3-4B
    7-TRANSCODER-HP
    INDEX 92818
    references to constructing or developing something, especially in “to build” verb phrases across technical or organizational contexts.
    gpt-5
    learn essential skills needed to build apps for Android.↵↵The
    Neuronpedia logo
    QWEN3-4B
    7-TRANSCODER-HP
    INDEX 72252
    mentions of software UI tab navigation, often activating on the three-letter sequence appearing alone or embedded within longer terms.
    gpt-5
    in a new window or tab and exceptions – opens in
    Neuronpedia logo
    QWEN3-4B
    7-TRANSCODER-HP
    INDEX 24497
    questions or statements about whether something has ever occurred (lifetime experience, including negations like “not at any time”).
    gpt-5
    rate. But have you ever wondered just how the financial
    Neuronpedia logo
    QWEN3-4B
    7-TRANSCODER-HP
    INDEX 15411
    superlative-degree constructions, particularly phrases indicating something is among the top or best within a category.
    gpt-5
    ’s voice remains one of the strongest. In an article
    Neuronpedia logo
    QWEN3-4B
    7-TRANSCODER-HP
    INDEX 130178
    uses of the adjective denoting completeness/entirety, including capitalized occurrences as part of proper names or titles.
    gpt-5
    . One aspect of the total toilet training process is the
    Neuronpedia logo
    QWEN3-4B
    7-TRANSCODER-HP
    INDEX 10399
    references to outcomes or findings—mentions of the result of an action, experiment, query, or study.
    gpt-5
    . Marketing and innovation produce results, all the rest are
    Neuronpedia logo
    QWEN3-4B
    7-TRANSCODER-HP
    INDEX 10
    mentions of “profile” and closely related profile-page or profile-metadata terms, especially in account, biographical, or listing contexts.
    gpt-5
    . To ensure reliability, profile information of each included article
    Neuronpedia logo
    GEMMA-2-2B
    11-GEMMASCOPE-RES-16K
    INDEX 6011
    references to complex rhythmic structures in music theory.
    gpt-5
    syncopation, complex polyrhythms,
    Neuronpedia logo
    GPT-OSS-20B
    23-RESID-POST-AA
    INDEX 98003
    technical content about printers/MFPs—especially model identifiers, toner/cartridge details, and specification-style features like speeds, resolutions, and connectivity
    gpt-5
    speeds: 50-70 pages per minute scanning.
    Neuronpedia logo
    GPT-OSS-20B
    23-RESID-POST-AA
    INDEX 97090