Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APICircuit TracerNEWSteerSAE EvalsExportsSlackBlogPrivacy & TermsContact
    © Neuronpedia 2025
    Privacy & TermsBlogGitHubSlackTwitterContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    words and pronouns referring to people, such as "child," "inmate," "she," and "they."
    gemini-2.5-flash
     our discipline measures. The child is starting to hide the
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 7615
    numerical expressions featuring the digit 5, especially round figures with trailing zeros or .5 values (percentages, currencies, measurements).
    gpt-5
    :10.5061/dryad
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 5147
    hyphenated compound words starting with "self-".
    deepseek-r1
    or specialty addictions treatment, self-monitoring)\↵*
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 7367
    copyright declarations and licensing headers in code or documentation files.
    deepseek-r1
    =============================================↵    Copyright (c) 2
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 11930
    numbers and numerical values embedded in text.
    deepseek-v3
    :10.5061/dryad
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 5147
    numerical values with decimal points or percentages.
    deepseek-r1
    :10.5061/dryad
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 5147
    The main thing this neuron does is find the digit "0" and, to a lesser extent, the digit "5" within numerical sequences.
    gemini-2.5-flash
    :10.5061/dryad
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 5147
    numbers in the 70s range, especially when they appear as standalone tokens or inside mathematical expressions and references.
    gpt-5
    33, 76 L.Ed.2
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 9726
    phrases that mark temporal progression, especially indicating events occurring after a prior event.
    gpt-5
     micro-optimization land after this point. You're
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 2554
    phrases introducing a temporal precondition—marking that once a prior condition is satisfied, the next action follows.
    gpt-5
    Musashimaru> Une fois activé, si
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 8820
    mentions of specialized proper nouns and technical terms, such as class/type names, scientific terminology, and named entities (places, people, organizations).
    gpt-5
     future performance.<bos>Q:↵↵Is there
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 5486
    verbs that express causing, enabling, or maintaining a change of state or outcome.
    gpt-5
    , and stagecraft to bring this character to life.
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 484
    references to “R/r”-led technical terms or symbols—such as scientific acronyms, mathematical variables, or notation-heavy tokens featuring R.
    gpt-5
     been formulated within the dynamical RPA so that the plasmons
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 14691
    uses of the definite article marking specific references, often before superlatives or key nouns
    gpt-5
     of the data may be the most natural way to distribute
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 4188
    phrases that express non-strict numerical comparisons and equality (e.g., “greater/less than or equal,” “at least,” “at most,” and “equal to”).
    gpt-5
    47 greater than or equal to 93/
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 9687
    mentions of rankings and enumerated lists, especially “top/first/second/third” itemizations with numbers, separators, and comparative ordering.
    gpt-5
     the nearest 20 are shared. Of these,
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 341
    multi-part proper names in formal/legal contexts, especially person or organization names with initials, degrees, or corporate designators and case-party listings.
    gpt-5
     Cecelia M. Beall, agree to continue this
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 5333
    text containing non-ASCII characters—such as accented letters, diacritics, encoded entities, or other foreign-language fragments.
    gpt-5
    ↵        can be divided vy 3.↵        
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 9428
    string parsing and manipulation operations in code, especially delimiter/regex-based tokenization and related function calls and symbols.
    gpt-5
    var parts = url.split("&");↵      var
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 13896
    explanatory statements asserting something isn’t needed because it’s already guaranteed or achieved, typically justified with “as/since” clauses.
    gpt-5
     non-distributed CFS versions as the distributed versions were designed
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 14276