Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APICircuit TracerNEWSteerSAE EvalsBlogSlackPrivacy & TermsContact
    © Neuronpedia 2025
    Privacy & TermsBlogGitHubSlackTwitterContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    This neuron detects core “mind and consciousness” jargon—terms like mind, conscious/consciousness, “black box,” “hard problem,” etc., that flag philosophical discussion of the mind–body problem.
    o4-mini
    the typically metaphysical distinction between what is inside the box
    Neuronpedia logo
    LLAMA3.1-8B
    21-LLAMASCOPE-RES-32K
    INDEX 17571
    mentions of concepts related to consciousness, perception, and mind-body relation.
    gemini-2.0-flash
    the typically metaphysical distinction between what is inside the box
    Neuronpedia logo
    LLAMA3.1-8B
    21-LLAMASCOPE-RES-32K
    INDEX 17571
    concepts related to the mind, consciousness, and thought.
    gemini-1.5-pro
    the typically metaphysical distinction between what is inside the box
    Neuronpedia logo
    LLAMA3.1-8B
    21-LLAMASCOPE-RES-32K
    INDEX 17571
    words and phrases related to the mind, consciousness, and the nature of reality, often in a philosophical or scientific context.
    gemini-1.5-flash
    the typically metaphysical distinction between what is inside the box
    Neuronpedia logo
    LLAMA3.1-8B
    21-LLAMASCOPE-RES-32K
    INDEX 17571
    philosophical and scientific discussions about consciousness, mind, and perception.
    claude-3-5-haiku-20241022
    the typically metaphysical distinction between what is inside the box
    Neuronpedia logo
    LLAMA3.1-8B
    21-LLAMASCOPE-RES-32K
    INDEX 17571
    citation numbers and reference identifiers in academic papers.
    claude-3-7-sonnet-20250219
    3](#jah32587-bib-
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-TRANSCODER-16K
    INDEX 14110
    references to MRI scans or medical imaging technology.
    claude-3-5-haiku-20241022
    The beauty of using the MRI scanner to administer the therapy
    Neuronpedia logo
    GPT2-SMALL
    0-RES-JB
    INDEX 2
    phrases related to enjoyment, experience, and sensory pleasure.
    claude-3-5-haiku-20241022
     microphone for the ultimate karaoke experience! With the HiFi
    Neuronpedia logo
    GEMMA-2-2B
    3-GEMMASCOPE-RES-16K
    INDEX 6409
    the name "Vic".
    gemini-2.0-flash
    alis.↵↵Watch Vicke Davis receive $2
    Neuronpedia logo
    GPT2-SMALL
    6-RES-JB
    INDEX 123
    expressions related to incrementing or modifying numerical values in code.
    gpt-4o
    totalLeftBoxesWidth += leftPaddingAddition;↵
    Neuronpedia logo
    GEMMA-2-2B
    12-GEMMASCOPE-RES-16K
    INDEX 123
    personal pronouns like "I", "me", "my", and personal storytelling language.
    claude-3-5-haiku-20241022
    , and I sewed a lot in high school, then
    Neuronpedia logo
    GEMMA-2-2B
    23-GEMMASCOPE-TRANSCODER-16K
    INDEX 12238
    terms related to legal and law enforcement contexts.
    gpt-4o-mini
     Regiment, whose Regimental Headquarters was at St Patrick'
    Neuronpedia logo
    GEMMA-2-2B
    4-GEMMASCOPE-TRANSCODER-16K
    INDEX 7671
    references to the term "Enix" from the name "Square Enix".
    gpt-4o
    Media Briefing↵Square Enix Media Briefing↵
    Neuronpedia logo
    LLAMA3.1-8B
    25-LLAMASCOPE-MLP-131K
    INDEX 72584
    short words or word segments, particularly those with numeric or symbolic characters.
    deepseek-v3
     of Light, making their mana regen come mostly from Illumination
    Neuronpedia logo
    GEMMA-2-2B
    19-GEMMASCOPE-TRANSCODER-16K
    INDEX 10165
    references to African countries, particularly Burkina Faso and Burundi. The neuron strongly activates on tokens like "Burkina", "Burundi", and similar country name fragments, while ignoring mentions of non-African nations. This suggests it specializes in detecting geographical references to specific African regions.
    deepseek-v3
    emptorists from Burkina Fasso; and a
    Neuronpedia logo
    LLAMA3.1-8B
    22-LLAMASCOPE-RES-131K
    INDEX 5743
    phrases related to specific names or titles, often emphasizing an organization's name or a unique phrase significant in context.
    gpt-4o
     year, the Veterans Giving Circle donated $2,7
    Neuronpedia logo
    GEMMA-2-2B
    12-GEMMASCOPE-RES-16K
    INDEX 12266
    elements related to Android layout attributes and settings.
    gpt-4o
    atesi, S. de Gironcoli, R
    Neuronpedia logo
    GEMMA-2-2B
    12-GEMMASCOPE-RES-16K
    INDEX 2084
    scientific terminology related to chemical and material sciences, with a focus on elements and compounds, specifically nanoparticles.
    gpt-4-turbo
    9 min and argon plasma for 3 min
    Neuronpedia logo
    GEMMA-2-2B
    12-GEMMASCOPE-RES-16K
    INDEX 643
    measurements and data related to experimental outcomes, particularly focusing on times, durations, and quantities.
    gpt-4o
    9 min and argon plasma for 3 min
    Neuronpedia logo
    GEMMA-2-2B
    12-GEMMASCOPE-RES-16K
    INDEX 643
    forms of the word "display" and related terms.
    deepseek-v3
    oscopic 3D image display device generally includes an optical
    Neuronpedia logo
    GEMMA-2-2B
    1-GEMMASCOPE-MLP-16K
    INDEX 8935