Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APICircuit TracerNEWSteerSAE EvalsExportsSlackBlogPrivacy & TermsContact
    © Neuronpedia 2025
    Privacy & TermsBlogGitHubSlackTwitterContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    first-person pronouns and phrases expressing personal thoughts, feelings, intentions, or experiences.
    gpt-4.1-2025-04-14
     was excited and myself because I had really missed Rick!
    Neuronpedia logo
    GEMMA-2-9B
    33-GEMMASCOPE-RES-131K
    INDEX 65863
    explicit references to personal or emotional moments.
    gpt-4.1-nano
     was excited and myself because I had really missed Rick!
    Neuronpedia logo
    GEMMA-2-9B
    33-GEMMASCOPE-RES-131K
    INDEX 65863
    the subject "I" and related forms in sentences.
    gpt-4o
     was excited and myself because I had really missed Rick!
    Neuronpedia logo
    GEMMA-2-9B
    33-GEMMASCOPE-RES-131K
    INDEX 65863
    words and phrases related to violent, destructive, or unethical actions and intentions.
    claude-3-5-sonnet-20240620
    that are peaceful or get in the way of mass-de
    Neuronpedia logo
    LLAMA3.1-8B-IT
    11-RESID-POST-AA
    INDEX 87027
    references to the act of killing or eliminating targets.
    gpt-4o
    and kill them quickly. If your
    Neuronpedia logo
    LLAMA3.1-8B
    20-LLAMASCOPE-RES-32K
    INDEX 20724
    proper nouns, specifically names related to individuals and organizations.
    gpt-4o-mini
    17 February 1963) is
    Neuronpedia logo
    QWEN3-4B
    30-TRANSCODER-HP
    INDEX 15188
    descriptions of small, unusual, or unique animals with distinctive physical characteristics.
    claude-3-7-sonnet-20250219
    ting these shy and nocturnal creatures, hedgehogs
    Neuronpedia logo
    LLAMA3-8B-IT
    25-RES-JH
    INDEX 31393
    the titles "Ms." followed by a surname or name.
    gpt-4.1-2025-04-14
     character assassination with regard to Ms Anthony, the woman she
    Neuronpedia logo
    GEMMA-2-9B
    23-GEMMASCOPE-RES-131K
    INDEX 53
    geographic locations, jurisdictional markers, and legal reference identifiers
    o3-mini
     Subsequently, the Scioto County Grand↵↵Jury returned
    Neuronpedia logo
    GEMMA-2-2B
    25-GEMMASCOPE-RES-16K
    INDEX 2
    license and copyright text from open-source software headers.
    claude-3-5-haiku-20241022
      You may obtain a copy
    Neuronpedia logo
    GEMMA-2-2B
    17-GEMMASCOPE-RES-65K
    INDEX 28471
    parenthetical clauses that provide clarifying or descriptive information.
    gpt-4o
     location for creating app users (my_app) —
    Neuronpedia logo
    GEMMA-2B
    10-RES-JB
    INDEX 1784
    programming or technical code snippets with numbers, symbols, and computational language.
    claude-3-5-haiku-20241022
     Jack Coffey Field on November 1 to host Colgate.
    Neuronpedia logo
    GEMMA-2-2B
    22-GEMMASCOPE-MLP-16K
    INDEX 15849
    mentions and discussions of gender, specifically references to "men" and "women."
    gpt-4.1-2025-04-14
     HRQOL for women at baseline. At the
    Neuronpedia logo
    GEMMA-2-9B
    23-GEMMASCOPE-RES-131K
    INDEX 23132
    academic and professional language related to medical, organizational, and institutional contexts.
    claude-3-5-haiku-20241022
     and specialisation. For emergency cases the rescue station has
    Neuronpedia logo
    GEMMA-2-9B
    31-GEMMASCOPE-RES-16K
    INDEX 16380
    scientific or medical research terminology related to cell processes and experimental procedures.
    claude-3-5-haiku-20241022
    KDM2B cells after 3 h treatment
    Neuronpedia logo
    GEMMA-2-9B
    31-GEMMASCOPE-RES-16K
    INDEX 16345
    phrases related to dry eyes and blinking.
    claude-3-5-haiku-20241022
    )↵   Dry fur                     80.
    Neuronpedia logo
    GEMMA-2-9B
    31-GEMMASCOPE-RES-16K
    INDEX 16328
    medical conditions and patient demographics.
    claude-3-5-haiku-20241022
     in the United States. People with Hemophilia
    Neuronpedia logo
    GEMMA-2-9B
    31-GEMMASCOPE-RES-16K
    INDEX 16275
    scientific language about medicinal plants, their uses, and nutritional properties.
    claude-3-5-haiku-20241022
     The seeds are generally used for condiments in various food preparations
    Neuronpedia logo
    GEMMA-2-9B
    31-GEMMASCOPE-RES-16K
    INDEX 16259
    medical and scientific terminology related to research and clinical studies.
    claude-3-5-haiku-20241022
     the analysis of the asthma course during concomitant obesity or excessive
    Neuronpedia logo
    GEMMA-2-9B
    31-GEMMASCOPE-RES-16K
    INDEX 16254
    terms related to cholesterol, cardiovascular health, and medical research on lipids.
    claude-3-5-haiku-20241022
     the prevalence of classical (lipid profile, blood pressure,
    Neuronpedia logo
    GEMMA-2-9B
    31-GEMMASCOPE-RES-16K
    INDEX 16226