Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APICircuit TracerNEWSteerSAE EvalsBlogSlackPrivacy & TermsContact
    © Neuronpedia 2025
    Privacy & TermsBlogGitHubSlackTwitterContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    This neuron detects sentence beginnings, particularly detecting "It is" or "This is" at the start of sentences and paragraphs.
    claude-3-7-sonnet-20250219
     to win the title.  Usually the Panamanian
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-16K
    INDEX 15754
    first-person pronouns and helping verbs (particularly "I" and auxiliary verbs like "would", "was", "need", "can").
    claude-3-5-haiku-20241022
    <start_of_turn>user↵<bos>↵I was worried about you said
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-16K
    INDEX 8394
    references to giving preference, priority, or favor to something or someone.
    gpt-4.1-2025-04-14
     the most effective, give preference to local firms, find
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 36473
    specific abbreviations or acronyms formatted with uppercase letters and numbers, often appearing in technical or legal contexts.
    deepseek-r1
     [@lecam; @LC2000];
    Neuronpedia logo
    GEMMA-2-2B
    0-GEMMASCOPE-RES-16K
    INDEX 7657
    technical acronyms and short codes, particularly those related to scientific and computing domains (like LC, LF, etc.).
    claude-3-5-haiku-20241022
     [@lecam; @LC2000];
    Neuronpedia logo
    GEMMA-2-2B
    0-GEMMASCOPE-RES-16K
    INDEX 7657
    text related to theater, opera, drama, and performing arts, especially in non-English languages.
    claude-3-7-sonnet-20250219
    , zarzuelas y óperas).↵↵Comp
    Neuronpedia logo
    LLAMA3.1-8B
    25-LLAMASCOPE-RES-32K
    INDEX 13895
    descriptions of furniture and home goods with their properties, materials, and quality features.
    claude-3-7-sonnet-20250219
    useful wherever it is required it might be in the kitchen
    Neuronpedia logo
    LLAMA3.1-8B
    25-LLAMASCOPE-RES-32K
    INDEX 17605
    The neuron detects the word "gate" in electronic and computing contexts.
    claude-3-7-sonnet-20250219
    .↵Here, a gate-to-source capacitance
    Neuronpedia logo
    GEMMA-2-2B
    2-GEMMASCOPE-TRANSCODER-16K
    INDEX 14269
    phrases related to political history, citizenship rights, and systemic injustices, particularly focusing on complex historical narratives about oppression, rights, and national struggles.
    claude-3-5-haiku-20241022
     rule and promised Congolese citizens a better future with greater
    Neuronpedia logo
    GEMMA-2-2B
    25-GEMMASCOPE-TRANSCODER-16K
    INDEX 2301
    strings and other programming language related terms.
    gemini-2.0-flash
     broken or inadequate. Such incomplete or broken connections have caused
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-16K
    INDEX 3448
    technical keywords, variables, codes, and structured identifiers, especially those related to programming, technical specifications, or data labels.
    gpt-4.1-2025-04-14
     broken or inadequate. Such incomplete or broken connections have caused
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-16K
    INDEX 3448
    words relating to Christianity and spiritual practice.
    gemini-2.0-flash
    , father, songwriter, worship leader and St. Louis
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-16K
    INDEX 16080
    references to Christian faith, church activities, and religious community.
    gpt-4.1-2025-04-14
    , father, songwriter, worship leader and St. Louis
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-16K
    INDEX 16080
    a mix of dates, times, years and code related terms.
    gemini-2.0-flash
    = 0x4↵<end_of_turn>↵
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-16K
    INDEX 5222
    technical jargon, codes, or identifiers frequently used in programming, system logs, error messages, and system or network operations.
    gpt-4.1-2025-04-14
    = 0x4↵<end_of_turn>↵
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-16K
    INDEX 5222
    capitalized text, particularly in titles, headings, and code/programming identifiers.
    claude-3-7-sonnet-20250219
     hold↵of VDRCONVERT.↵The README file
    Neuronpedia logo
    GEMMA-2-2B
    12-GEMMASCOPE-RES-16K
    INDEX 12716
    instances where information or content is made publicly available.
    gpt-4o
    of contributions to the fund publicly available.↵↵“Defendants
    Neuronpedia logo
    LLAMA3-8B-IT
    25-RES-JH
    INDEX 0
    This neuron detects the presence of a question—marking interrogative constructions (e.g. the “Q:” prompt, question‐word phrases, question marks, etc.).
    o4-mini
    <bos>Q: What is the capital of Kansas?
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-RES-16K
    INDEX 13413
    scientific publications, with a particular focus on research methodology, results, and experimental design within those publications.
    gemini-2.0-flash
     low EQE of the fluorescence device. Furthermore, as
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-16K
    INDEX 11654
    terms and phrases referring to categories, groups, types, classes, or roles that organize or distinguish entities, especially within technical or scientific contexts.
    gpt-4.1-2025-04-14
     low EQE of the fluorescence device. Furthermore, as
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-16K
    INDEX 11654