© Neuronpedia 2026
    Privacy & TermsBlogGitHubSlackTwitterContact
    Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APIAssistant AxisNEWCircuit TracerNEWSteerSAE EvalsExports Community BlogPrivacy & TermsContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    instructional verb-phrase constructions that direct someone to undertake an action or allocate time/care, especially in advice or step-by-step guidance contexts.
    gpt-5
    that a little alarming. Take your time pumpkin, no
    Neuronpedia logo
    GEMMA-3-1B
    13-GEMMASCOPE-2-RES-65K
    INDEX 1467
    action verbs denoting concrete actions or processes, especially phrasal/two‑word verbs and past or imperative forms.
    gpt-5
    back from a woman who closed her note with “t
    Neuronpedia logo
    GEMMA-3-1B
    13-GEMMASCOPE-2-RES-65K
    INDEX 589
    uses of the verb “make” (including its inflected forms and common collocations).
    gpt-5
    . They honor requests and make suggestions for best practices.
    Neuronpedia logo
    GEMMA-3-1B
    13-GEMMASCOPE-2-RES-65K
    INDEX 2333
    the neuron responds strongly to very common, high-probability tokens (frequent function words and common verbs).
    gpt-5-mini
    . They honor requests and make suggestions for best practices.
    Neuronpedia logo
    GEMMA-3-1B
    13-GEMMASCOPE-2-RES-65K
    INDEX 2333
    phrases related to branding and corporate presentation and client experience.
    gpt-5-nano
    . They honor requests and make suggestions for best practices.
    Neuronpedia logo
    GEMMA-3-1B
    13-GEMMASCOPE-2-RES-65K
    INDEX 2333
    array or list indexing operations with square brackets in code.
    claude-4-5-sonnet
                    int dTemp = n[j];↵                
    Neuronpedia logo
    GEMMA-2-9B
    20-GEMMASCOPE-RES-16K
    INDEX 15572
    underscore characters in code, particularly in variable names, constants, and identifiers.
    claude-4-5-sonnet
    format,↵    endpoints_output_format,↵
    Neuronpedia logo
    GEMMA-2-9B
    20-GEMMASCOPE-RES-16K
    INDEX 8551
    JavaScript object-oriented programming patterns, particularly object property definitions, method declarations, and constructor functions with `this` references.
    claude-4-5-sonnet
    defaults: {},↵↵ /**↵  * Register
    Neuronpedia logo
    GEMMA-2-9B
    20-GEMMASCOPE-RES-16K
    INDEX 2045
    line breaks or whitespace between sections in technical documentation or structured text.
    claude-4-5-sonnet
    9″ ‘Boston’ alloy wheels on GT. Also
    Neuronpedia logo
    GEMMA-2-9B
    20-GEMMASCOPE-RES-16K
    INDEX 14658
    proper names, particularly surnames, in bibliographic citations and text references.
    claude-4-5-sonnet
    cek Hillebrandt & Truran. 199
    Neuronpedia logo
    GEMMA-2-9B
    20-GEMMASCOPE-RES-16K
    INDEX 310
    the keyword "extends" in class declarations across various programming languages.
    claude-4-5-sonnet
    public class StatusEffectManager extends UntypedActor { ↵
    Neuronpedia logo
    GEMMA-2-9B
    20-GEMMASCOPE-RES-16K
    INDEX 712
    the beginning of a document (start token).
    claude-4-5-sonnet
     Pennsylvania<bos>/******************************************************************
    Neuronpedia logo
    GEMMA-2-9B
    20-GEMMASCOPE-RES-16K
    INDEX 15442
    mathematical and technical formatting markup, particularly LaTeX symbols, asterisks for emphasis, and special characters used in academic or technical documents.
    claude-4-5-sonnet
     a *Cartan scheme* if↵↵1.  
    Neuronpedia logo
    GEMMA-2-9B
    20-GEMMASCOPE-RES-16K
    INDEX 2337
    commas in lists or between clauses, particularly in formal or technical writing.
    claude-4-5-sonnet
    d expect while singing or speaking. And because it’
    Neuronpedia logo
    GEMMA-2-9B
    20-GEMMASCOPE-RES-16K
    INDEX 11381
    abbreviated variable names or constants in programming code, particularly those with underscores separating components.
    claude-4-5-sonnet
    ↵#define STA_PPSFREQ 0x0
    Neuronpedia logo
    GEMMA-2-9B
    20-GEMMASCOPE-RES-16K
    INDEX 4410
    the word "employee" or "employees" in workplace and employment-related contexts.
    claude-4-5-sonnet
     the employee to the particular danger causing
    Neuronpedia logo
    GEMMA-2-9B
    20-GEMMASCOPE-RES-16K
    INDEX 3675
    the pronoun "He" (or "he") when referring to a male person in the text.
    claude-4-5-sonnet
     truly interfaith couple. He and I need
    Neuronpedia logo
    GEMMA-2-9B
    20-GEMMASCOPE-RES-16K
    INDEX 10376
    phrases indicating negative assessments or concerning situations, particularly those involving criticism, problems, or undesirable circumstances.
    claude-4-5-sonnet
     long-term prognosis is alarming. As Reuters puts it
    Neuronpedia logo
    GEMMA-2-9B
    20-GEMMASCOPE-RES-16K
    INDEX 9091
    closing braces or brackets that end control flow blocks in code.
    claude-4-5-sonnet
    )↵ return false;↵↵#ifdef PB_
    Neuronpedia logo
    GEMMA-2-9B
    20-GEMMASCOPE-RES-16K
    INDEX 14275
    technical terminology related to mechanical systems, medical procedures, and scientific equipment.
    claude-4-5-sonnet
     tronic version of the DSG transaxle went into series production
    Neuronpedia logo
    GEMMA-2-9B
    20-GEMMASCOPE-RES-16K
    INDEX 9741