Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APICircuit TracerNEWSteerSAE EvalsExportsSlackBlogPrivacy & TermsContact
    © Neuronpedia 2025
    Privacy & TermsBlogGitHubSlackTwitterContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    the substring “her/Her,” especially when it appears at the start of capitalized words or proper names.
    gpt-5
    .↵*   **Heritability Studies (Ongoing):
    Neuronpedia logo
    GEMMA-3-27B-IT
    53-GEMMASCOPE-2-RES-262K
    INDEX 23919
    mentions of organized sports—especially basketball and collegiate athletics—covering teams, leagues, competitions, and rule or draft contexts.
    gpt-5
    of high-level competition in men's college basketball
    Neuronpedia logo
    GEMMA-3-27B-IT
    53-GEMMASCOPE-2-RES-262K
    INDEX 14749
    discussions of LGBTQ+ identities and sexuality, especially definitions, debates, or explanatory content about transgender/gender identity, sexual orientation, and related concepts.
    gpt-5
    ↵* **Distinction between feelings and reality:** While
    Neuronpedia logo
    GEMMA-3-27B-IT
    53-GEMMASCOPE-2-RES-262K
    INDEX 227658
    requests for sexual or erotically suggestive content—such as sexting, seductive personas, or explicit scene writing.
    gpt-5
    aim for evocative language that suggests allure and self-assured
    Neuronpedia logo
    GEMMA-3-27B-IT
    53-GEMMASCOPE-2-RES-262K
    INDEX 45963
    content about marriage and family relationships, including weddings, spousal dynamics, adultery, and divorce
    gpt-5
    to include property acquired *after* marriage.↵    
    Neuronpedia logo
    GEMMA-3-27B-IT
    53-GEMMASCOPE-2-RES-262K
    INDEX 6497
    the lowercase subword token “he” occurring within words, regardless of context.
    gpt-5
    meat processing industry depends *heavily* on **
    Neuronpedia logo
    GEMMA-3-27B-IT
    53-GEMMASCOPE-2-RES-262K
    INDEX 37610
    mentions of the Secure Shell protocol and related technical contexts like configuration, services, and keys.
    gpt-5
    the use-case for ssh principals? How should I
    Neuronpedia logo
    GEMMA-3-27B-IT
    53-GEMMASCOPE-2-RES-262K
    INDEX 38927
    references to women in marital contexts—female possessives, wives/wives-as-spouses, and marriage/property relationships between wives and husbands.
    gpt-5
    into* the marriage became her husband's.  
    Neuronpedia logo
    GEMMA-3-27B-IT
    53-GEMMASCOPE-2-RES-262K
    INDEX 15121
    narrative transition cues—clause openings and connective words that signal shifts in time, emphasis, or scene within a story.
    gpt-5
    of sadness in her chest when she noticed him.  
    Neuronpedia logo
    GEMMA-3-27B-IT
    53-GEMMASCOPE-2-RES-262K
    INDEX 129164
    mentions of pairs—two related items treated as a unit—across technical or quantitative contexts (e.g., pairing, twin/paired entities, key-value or entangled pairs)
    gpt-5
    ↵↵There are 50 such pairs, each summing
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 48387
    references to the Philippines, triggering on Tagalog text and mentions of Philippine entities, places, or topics.
    gpt-5
    maraming pananim ng sibuyas.  Dagdag
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 21553
    mentions of romantic relationship status and living arrangements, especially cohabitation, marriage, and partnership contexts.
    gpt-5
    of Alternatives:**  Cohabitation, long-term
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 227658
    words containing the “sh(e)”/German “sch” letter cluster, especially at the start of proper names.
    gpt-5
    psychological needs. *Shefrin, H. M
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 63936
    content about sensitive social identities and demographics—especially gender identity, race, and sexual/explicit topics.
    gpt-5
    aligns with the sex they were assigned at birth.  
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 14749
    discussions of LGBTQ identities and rights, anti-discrimination and inclusivity themes, and related supportive or safety-policy/advocacy content (including references to resources and healthcare).
    gpt-5
    and the importance of acceptance and support.↵* **
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 141391
    apostrophes marking English contractions, especially negative forms.
    gpt-5
    details.↵↵**Option 2: More Technical &
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 1463
    references to transgender and broader LGBTQ identities, issues, and related support resources or organizations.
    gpt-5
    )↵    *   **Local LGBTQ+ Centers:**
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 45963
    mentions of male individuals and roles—terms denoting men, boys, male kinship, or masculine titles and identities
    gpt-5
    in his new MacBook Air, Professor Armitage noticed a
    Neuronpedia logo
    GEMMA-3-27B-IT
    40-GEMMASCOPE-2-RES-262K
    INDEX 20047
    requests for definitions or methods related to technical topics—tools, procedures, or features—especially in software, data/BI, networking, finance, and UI contexts.
    gpt-5
    What are the method for cancellation on insurance? please be
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-262K
    INDEX 2243
    snake_case programming identifiers with underscores, such as variable or function names commonly found in code.
    gpt-5
    risk of not investing in this effort.   Your target
    Neuronpedia logo
    GEMMA-3-27B-IT
    31-GEMMASCOPE-2-RES-262K
    INDEX 15645