© Neuronpedia 2026
    Privacy & TermsBlogGitHubSlackTwitterContact
    Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    Natural Language
    Autoencoders
    NEW
    Assistant AxisNEWCircuit TracerUPDATESteerSAE EvalsExportsAPI Community BlogPrivacy & TermsContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    expressions of strong desire or wanting something intensely, particularly when accompanied by intensifying adverbs like "really," "so," or "badly."
    claude-4-5-sonnet
    to watch an action movie so badly, then I challenge
    Neuronpedia logo
    LLAMA3.1-8B-IT
    27-RESID-POST-AA
    INDEX 33989
    references to sexual abuse or exploitation of minors.
    claude-4-5-sonnet
    chat group about having sex with teenage girls. Chub
    Neuronpedia logo
    LLAMA3.1-8B-IT
    27-RESID-POST-AA
    INDEX 20318
    references to buttocks or the posterior, especially in informal or idiomatic expressions.
    claude-4-5-sonnet
    , or you can sit your butt in a chair and
    Neuronpedia logo
    LLAMA3.1-8B-IT
    27-RESID-POST-AA
    INDEX 113564
    possessive or attributive references to dogs and their characteristic actions or features.
    claude-4-5-sonnet
    greet him at the door, tail wagging excitedly
    Neuronpedia logo
    LLAMA3.1-8B-IT
    23-RESID-POST-AA
    INDEX 66016
    expressions of desire or intent to perform actions toward others, particularly in contexts of sexual pursuit or coercion.
    claude-4-5-sonnet
    droit du seigneur to sleep with a servant girl
    Neuronpedia logo
    LLAMA3.1-8B-IT
    27-RESID-POST-AA
    INDEX 98990
    content about dogs and their affectionate behaviors or activities.
    claude-4-5-sonnet
    you like to chase after a frisbee, the
    Neuronpedia logo
    LLAMA3.1-8B-IT
    27-RESID-POST-AA
    INDEX 46525
    The neuron activates on tokens that are unit symbols (e.g. N, J, W, Pa, etc.), i.e. abbreviations denoting physical measurement units.
    o4-mini
     24.7 N are applied by the middle
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 269
    units of physical measurement (particularly SI units like Newtons, Pascals, Joules, and Tesla).
    claude-4-5-haiku
     24.7 N are applied by the middle
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 269
    the substring "col" in programming contexts, particularly when referring to columns in databases, dataframes, or grid structures.
    claude-4-5-sonnet
    ↵select t2.col1 a, t1
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 123861
    comment lines in code that serve as section headers or separators.
    claude-4-5-sonnet
     {↵      // Style↵      var b = this
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 123860
    closing parentheses followed by semicolons in programming code.
    claude-4-5-sonnet
    (new TypeError(msg));↵};↵↵var util
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 123856
    whitespace and indentation at the beginning of lines in source code, particularly before function declarations and documentation comments.
    claude-4-5-sonnet
    Object</param>↵        /// <param name="
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 123842
    HTML line break tags `<br>` or `<br />`.
    claude-4-5-sonnet
     listing</h1>↵<hr/>↵<pre>↵
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 123840
    references to CALayer or Core Animation layer objects in iOS/macOS code.
    claude-4-5-sonnet
     rendering usage, like CALayer/WatchKit/Swift
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 123817
    the opening of C-style block comments (/*).
    claude-4-5-sonnet
    lib.h"↵↵/*
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 12381
    Cyrillic script characters, particularly the first letter of Cyrillic words or sentences.
    claude-4-5-sonnet
    "{↵            ec{"Абиџан"}↵
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 123805
    file extensions and technical identifiers in code or technical documentation.
    claude-4-5-sonnet
     find them in .process file, with has xml structure
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 123801
    technical security and vulnerability-related terms, especially those related to code security testing (like SQL injection, XSS, buffer overflow, and OWASP).
    claude-4-5-sonnet
    ASP | ZAP | SQL Injection | Scan Report↵↵When
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 123800
    whitespace and newlines in HTML code, particularly after closing script tags.
    claude-4-5-sonnet
    min.js"></script>↵<!-- d3 is
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 123773
    words containing the letter sequence "ag" followed by a vowel, particularly in scientific or technical terminology (like "Kagomé", "Drosophila", "gladiatorial", "UDF").
    claude-4-5-sonnet
    atsu Shiokiya Kagyō  (1
    Neuronpedia logo
    GEMMA-2-27B
    10-GEMMASCOPE-RES-131K
    INDEX 123771