© Neuronpedia 2026
    Privacy & TermsBlogGitHubSlackTwitterContact
    Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APIAssistant AxisNEWCircuit TracerNEWSteerSAE EvalsExports Community BlogPrivacy & TermsContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    foreign-language words and names, especially those with diacritics, non-Latin scripts, or hyphenated reduplication.
    gpt-5
    ↵↵Instrumen-instrumen tadi tidak memenuhi syarat Islam
    Neuronpedia logo
    GEMMA-3-1B
    7-GEMMASCOPE-2-RES-16K
    INDEX 14322
    references to people or collective human groups, often in constructions with relative clauses referring to them.
    gpt-5
    this day and age, everyone is being watched whether it
    Neuronpedia logo
    GEMMA-3-1B
    17-GEMMASCOPE-2-RES-16K
    INDEX 1967
    references to clandestine, mission-oriented actions—such as heists, assassinations, and tactical operations—covering planning, execution, and the agents involved.
    gpt-5
     carrying out that of a heist. Kiki can be seen
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 93143
    meta-discursive, emphatic framing in prose—generalized statements and reaction/argument setup using intensifiers, quantifiers, and function-word-heavy constructions.
    gpt-5
     which still seldom hit anyone. Swords and other bladed
    Neuronpedia logo
    GEMMA-2-9B-IT
    20-GEMMASCOPE-RES-131K
    INDEX 84570
    The neuron primarily detects numeric tokens (digits, numerals and year-like numbers) in the text.
    gpt-5-mini
    https://cloud.google.com/dialogflow](
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-262K
    INDEX 7114
    tokens involved in the model's self-introduction—first-person "I" + the contraction "m" and the assistant's name/identity.
    gpt-5-mini
    there! 👋 I'm Gemma, an open-
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-262K
    INDEX 71337
    mentions of the model's identity, creators, and related branding/attribution tokens (e.g., Gemma, team, created, Google, parts of the AI URL).
    gpt-5-mini
    model created by the Gemma team at Google DeepMind.
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-262K
    INDEX 9103
    promotional marketing language that includes calls to action or persuasive messaging.
    gpt-5-nano
     it is now closer.<bos>% Generated by roxygen
    Neuronpedia logo
    GEMMA-2-2B
    18-GEMMASCOPE-RES-16K
    INDEX 6670
    Detects the lexical token for thinking/cognition (the verb and its appearances in multiword phrases and compounds).
    gpt-5-mini
    doesnâĢĻt think so (pdf). He claims
    Neuronpedia logo
    GPT2-SMALL
    3-ATT_32K-OAI
    INDEX 3786
    the neuron detects system/role/metadata and instruction-like tokens — i.e., blocks of system or assistant prompt text and formatting.
    gpt-5-mini
    Atwood weighting** to accommodate analysis.↵↵**(in
    Neuronpedia logo
    GPT-OSS-20B
    23-RESID-POST-AA
    INDEX 6075
    The neuron detects tokens related to validation/verification (words about checking, validating, or verifying).
    gpt-5-mini
    4 | Сеттеры валидируют данные и выб
    Neuronpedia logo
    GPT-OSS-20B
    19-RESID-POST-AA
    INDEX 102896
    It detects text addressing or referring to a classroom teacher (requests or messages directed to a teacher).
    gpt-5-mini
    height=12, autop=False):↵    
    Neuronpedia logo
    GEMMA-3-270M-IT
    15-GEMMASCOPE-2-RES-262K
    INDEX 69844
    periods and punctuation marks that end structured entries or list items.
    claude-4-5-sonnet
    prior (or equal) to the live node in the
    Neuronpedia logo
    GEMMA-3-27B
    53-GEMMASCOPE-2-RES-65K
    INDEX 7424
    text in non-English languages, particularly German, Spanish, Portuguese, and Russian.
    claude-4-5-sonnet
    com apetência para caminhar no fio da navalha
    Neuronpedia logo
    GEMMA-3-27B
    53-GEMMASCOPE-2-RES-65K
    INDEX 2554
    periods ending sentences in non-English text.
    claude-4-5-sonnet
    oraz na terenie Niemiec. Grupa Kapitał
    Neuronpedia logo
    GEMMA-3-27B
    53-GEMMASCOPE-2-RES-65K
    INDEX 1580
    text written in Slavic languages, particularly Serbian/Croatian/Bosnian.
    claude-4-5-sonnet
    broj turista i posetilaca dostize cifru od
    Neuronpedia logo
    GEMMA-3-27B
    53-GEMMASCOPE-2-RES-65K
    INDEX 4907
    text in Cyrillic script, particularly Russian language content.
    claude-4-5-sonnet
    жестяных банках. ARCANOLLOAD
    Neuronpedia logo
    GEMMA-3-27B
    53-GEMMASCOPE-2-RES-65K
    INDEX 8060
    words and phrases in non-English languages, particularly Russian and other Cyrillic text.
    claude-4-5-haiku
    жестяных банках. ARCANOLLOAD
    Neuronpedia logo
    GEMMA-3-27B
    53-GEMMASCOPE-2-RES-65K
    INDEX 8060
    abstract plural count nouns that denote conceptual categories, attributes, or considerations in discourse.
    gpt-5
    reason that Easter has many symbols of new life:-
    Neuronpedia logo
    GEMMA-3-1B
    13-GEMMASCOPE-2-RES-16K
    INDEX 407
    question marks that end mathematical problems or questions.
    claude-4-5-sonnet
     of 28?↵True↵Let x
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 5623