Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APICircuit TracerNEWSteerSAE EvalsExportsSlackBlogPrivacy & TermsContact
    © Neuronpedia 2025
    Privacy & TermsBlogGitHubSlackTwitterContact
    EXPLANATION TYPE
    oai_token-act-pair
    Description
    OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
    Author
    OpenAI
    URL
    https://github.com/hijohnnylin/automated-interpretability
    Settings
    Default prompts from the main branch, strategy TokenActivationPair.
    Recent Explanations
    nonstandard text formatting or characters, especially Unicode whitespace/section breaks and mojibake from misencoded accented letters.
    gpt-5
    now Sunapee).  The name Goshen may
    Neuronpedia logo
    GEMMA-2-9B
    21-GEMMASCOPE-RES-16K
    INDEX 9947
    references to a specific iteration or occasion in time—often contrasting with previous attempts or indicating plans to do it again or in the future.
    gpt-5
     petition to reopen. This time the offer was upped
    Neuronpedia logo
    GEMMA-2-9B
    21-GEMMASCOPE-RES-16K
    INDEX 7713
    explanatory statements describing what software or code will do (future-tense descriptions of expected behavior or outcomes), often in instructional or comment-like contexts.
    gpt-5
     Disk Management. That brings up your drives. ↵
    Neuronpedia logo
    GEMMA-2-9B
    21-GEMMASCOPE-RES-16K
    INDEX 2123
    highly technical, structured text—especially code, mathematical/formula notation, and procedural scientific descriptions with variables, measurements, and list-like formatting.
    gpt-5
    1 <- createCell(rows, colIndex
    Neuronpedia logo
    GEMMA-2-9B
    21-GEMMASCOPE-RES-16K
    INDEX 15278
    LaTeX-style math notation, especially math-mode markers and superscript/exponent formatting.
    gpt-5
    2^{2}$, $1^{5}23
    Neuronpedia logo
    GEMMA-2-9B
    21-GEMMASCOPE-RES-16K
    INDEX 5770
    uses of the first-person plural point of view (inclusive group references like we/us/our).
    gpt-5
     not even worth mentioning. We could sit here and list
    Neuronpedia logo
    GEMMA-2-9B
    21-GEMMASCOPE-RES-16K
    INDEX 509
    commercial spam about online gambling (casinos/slots) and purchasing medications online, especially in promotional or “buy online” contexts.
    gpt-5
     After exposing all coins, blackjack surrender number of wilds presented
    Neuronpedia logo
    GEMMA-2-9B
    21-GEMMASCOPE-RES-16K
    INDEX 12950
    formal scientific or technical prose that describes methods or theoretical settings, especially purpose-driven “to”-infinitive constructions, citations/parentheticals, and explanatory clauses common in academic writing.
    gpt-5
     some strict-aliasing issue?  Or maybe unexpected
    Neuronpedia logo
    GEMMA-2-9B
    21-GEMMASCOPE-RES-16K
    INDEX 10251
    specialized, domain-specific nouns and jargon terms, particularly in technical or set-phrase collocations.
    gpt-5
     the ball, playing attacking attractive football, the players are
    Neuronpedia logo
    GEMMA-2-9B
    21-GEMMASCOPE-RES-16K
    INDEX 15718
    code-like identifiers and configuration parameters, especially those with underscores, double-underscore suffixes, symbols (e.g., $) or version digits, often appearing in assignments and field/method declarations
    gpt-5
     Config::SettingsWindowPosYID, Config::SettingsWindow
    Neuronpedia logo
    GEMMA-2-9B
    21-GEMMASCOPE-RES-16K
    INDEX 12340
    the use of the `Path.GetExtension` method or similar file path manipulation methods in C#.
    gemini-2.5-flash
    thisFileExt = Path.GetExtension(file.Name);↵
    Neuronpedia logo
    QWEN2.5-7B-IT
    23-RESID-POST-AA
    INDEX 1
    formal or technical nouns, particularly those referring to abstract concepts, objects, or categories in specialized or technical contexts.
    claude-4-5-sonnet
     if Vulkan is a good API? I've heard
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 6213
    formal or technical vocabulary, particularly medical terms and capitalized words that carry semantic importance.
    claude-4-5-haiku
     few minutes pass in this surreal repose until a hiker comes
    Neuronpedia logo
    GEMMA-2-2B
    1-CLT-HP
    INDEX 45085
    mentions of specific vegetables, fruits, and agricultural produce.
    claude-4-5-haiku
    ; his new favorite vegetable is kohlrabi
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-RES-16K
    INDEX 3333
    abstract nouns denoting states or qualities, especially those formed with suffixes like -ity, -ance/-ency, and some nominal -ing/-ure forms, with an additional spike at the beginning-of-sequence token.
    gpt-5
     Uncle joes son.<bos>Q:↵↵Looking for
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 14449
    occurrences of the lowercase definite article.
    gpt-5
     compute the matching score with the original detector/descriptor pairs
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 10831
    technical, domain-specific jargon and notation, especially in detailed mechanism descriptions, enumerated lists, code, or mathematical formulas.
    gpt-5
     is built into the cleat instead of the pedal.
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 46
    the start-of-sequence marker indicating the beginning of a text segment.
    gpt-5
    <bos>If we assume that x
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 10179
    nucleotide sequence notation—single-letter base codes and dinucleotide motifs commonly found in genomic data tables.
    gpt-5
    0.00                AC       0         0.
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 193
    Romance-language (French/Spanish/Portuguese) clause-introducing words—especially interrogative/relative conjunctions and pronouns that begin subordinate or question clauses.
    gpt-5
     est-ce normal ? Que puis-
    Neuronpedia logo
    GEMMA-2-2B
    13-GEMMASCOPE-RES-16K
    INDEX 7102