© Neuronpedia 2026
    Privacy & TermsBlogGitHubSlackTwitterContact
    Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APIAssistant AxisNEWCircuit TracerNEWSteerSAE EvalsExports Community BlogPrivacy & TermsContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    Detects the lexical token for thinking/cognition (the verb and its appearances in multiword phrases and compounds).
    gpt-5-mini
    doesnâĢĻt think so (pdf). He claims
    Neuronpedia logo
    GPT2-SMALL
    3-ATT_32K-OAI
    INDEX 3786
    the neuron detects system/role/metadata and instruction-like tokens — i.e., blocks of system or assistant prompt text and formatting.
    gpt-5-mini
    Atwood weighting** to accommodate analysis.↵↵**(in
    Neuronpedia logo
    GPT-OSS-20B
    23-RESID-POST-AA
    INDEX 6075
    The neuron detects tokens related to validation/verification (words about checking, validating, or verifying).
    gpt-5-mini
    4 | Сеттеры валидируют данные и выб
    Neuronpedia logo
    GPT-OSS-20B
    19-RESID-POST-AA
    INDEX 102896
    It detects text addressing or referring to a classroom teacher (requests or messages directed to a teacher).
    gpt-5-mini
    height=12, autop=False):↵    
    Neuronpedia logo
    GEMMA-3-270M-IT
    15-GEMMASCOPE-2-RES-262K
    INDEX 69844
    periods and punctuation marks that end structured entries or list items.
    claude-4-5-sonnet
    prior (or equal) to the live node in the
    Neuronpedia logo
    GEMMA-3-27B
    53-GEMMASCOPE-2-RES-65K
    INDEX 7424
    text in non-English languages, particularly German, Spanish, Portuguese, and Russian.
    claude-4-5-sonnet
    com apetência para caminhar no fio da navalha
    Neuronpedia logo
    GEMMA-3-27B
    53-GEMMASCOPE-2-RES-65K
    INDEX 2554
    periods ending sentences in non-English text.
    claude-4-5-sonnet
    oraz na terenie Niemiec. Grupa Kapitał
    Neuronpedia logo
    GEMMA-3-27B
    53-GEMMASCOPE-2-RES-65K
    INDEX 1580
    text written in Slavic languages, particularly Serbian/Croatian/Bosnian.
    claude-4-5-sonnet
    broj turista i posetilaca dostize cifru od
    Neuronpedia logo
    GEMMA-3-27B
    53-GEMMASCOPE-2-RES-65K
    INDEX 4907
    text in Cyrillic script, particularly Russian language content.
    claude-4-5-sonnet
    жестяных банках. ARCANOLLOAD
    Neuronpedia logo
    GEMMA-3-27B
    53-GEMMASCOPE-2-RES-65K
    INDEX 8060
    words and phrases in non-English languages, particularly Russian and other Cyrillic text.
    claude-4-5-haiku
    жестяных банках. ARCANOLLOAD
    Neuronpedia logo
    GEMMA-3-27B
    53-GEMMASCOPE-2-RES-65K
    INDEX 8060
    abstract plural count nouns that denote conceptual categories, attributes, or considerations in discourse.
    gpt-5
    reason that Easter has many symbols of new life:-
    Neuronpedia logo
    GEMMA-3-1B
    13-GEMMASCOPE-2-RES-16K
    INDEX 407
    question marks that end mathematical problems or questions.
    claude-4-5-sonnet
     of 28?↵True↵Let x
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 5623
    The neuron strongly activates on multiword title‐cased phrases used in formal or marketing contexts (e.g. product names, corporate titles, branded jargon).
    o4-mini
    sloh specialized Customer Service Representatives will respond to your inquiry
    Neuronpedia logo
    GEMMA-3-1B
    13-GEMMASCOPE-2-RES-16K
    INDEX 9107
    customer service and support related content.
    gpt-5-nano
    sloh specialized Customer Service Representatives will respond to your inquiry
    Neuronpedia logo
    GEMMA-3-1B
    13-GEMMASCOPE-2-RES-16K
    INDEX 9107
    the neuron detects capitalized named entities and title-case noun phrases (proper names, product names, and formal job/role titles).
    gpt-5-mini
    sloh specialized Customer Service Representatives will respond to your inquiry
    Neuronpedia logo
    GEMMA-3-1B
    13-GEMMASCOPE-2-RES-16K
    INDEX 9107
    phrases that convey superlatives or high-degree evaluations indicating exceptional prominence, importance, or extremity.
    gpt-5
    sloh specialized Customer Service Representatives will respond to your inquiry
    Neuronpedia logo
    GEMMA-3-1B
    13-GEMMASCOPE-2-RES-16K
    INDEX 9107
    references to ASD and neurodevelopmental disorders and discussions of global versus local visuospatial processing.
    gpt-5-nano
    partecipanti). I partecipanti con ASD-NP hanno ottenuto sc
    Neuronpedia logo
    GEMMA-3-1B
    22-GEMMASCOPE-2-RES-16K
    INDEX 10248
    The neuron strongly activates on Italian nouns (content words like subjects, objects, and entities).
    o4-mini
    partecipanti). I partecipanti con ASD-NP hanno ottenuto sc
    Neuronpedia logo
    GEMMA-3-1B
    22-GEMMASCOPE-2-RES-16K
    INDEX 10248
    The neuron detects mentions of participant groups, clinical diagnoses, and study-related subject labels (e.g., ASD, NLD, ADHD, participants, group).
    gpt-5-mini
    partecipanti). I partecipanti con ASD-NP hanno ottenuto sc
    Neuronpedia logo
    GEMMA-3-1B
    22-GEMMASCOPE-2-RES-16K
    INDEX 10248
    text written in Romance languages (especially Italian, Spanish, Portuguese, and French).
    gpt-5
    partecipanti). I partecipanti con ASD-NP hanno ottenuto sc
    Neuronpedia logo
    GEMMA-3-1B
    22-GEMMASCOPE-2-RES-16K
    INDEX 10248