Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APICircuit TracerNEWSteerSAE EvalsExportsBlogSlackPrivacy & TermsContact
    © Neuronpedia 2025
    Privacy & TermsBlogGitHubSlackTwitterContact
    EXPLANATION TYPE
    np_max-act-logits
    Description
    A Neuronpedia original that attempts to replicate Anthropic's autointerp used for their attribution graphs paper's features.
    Author
    Neuronpedia
    URL
    https://github.com/hijohnnylin/automated-interpretability/blob/4463a9fab7d4828bfd4c33194e64856b95377166/neuron_explainer/explanations/explainer.py#L811-L1135
    Settings
    Activations shown = 24 tokens around max act. Shows top 10 logits. Shows model the max activating token too.
    Recent Explanations
    inclusion
    claude-3-5-haiku-20241022
    , or they may be incorporated into separate bulletins. There
    Neuronpedia logo
    GEMMA-2-2B
    12-GEMMASCOPE-RES-32K
    INDEX 1802
    Time words
    gemini-2.0-flash
     people before we fly out tomorrow night. Our sojourn is
    Neuronpedia logo
    GEMMA-2-2B
    0-GEMMASCOPE-RES-16K
    INDEX 6
    adjust
    gpt-4o-mini
    "  container does get adjusted(i mean IE 
    Neuronpedia logo
    GEMMA-2-2B
    0-GEMMASCOPE-RES-16K
    INDEX 3
    con
    gemini-2.0-flash
     study population was non-consecutively included. No
    Neuronpedia logo
    GEMMA-2-2B
    0-GEMMASCOPE-RES-16K
    INDEX 1
    science/tech
    gpt-4.1-nano
     study population was non-consecutively included. No
    Neuronpedia logo
    GEMMA-2-2B
    0-GEMMASCOPE-RES-16K
    INDEX 1
    bleed
    gpt-4.1-nano
     of whom had continued to bleed following laparotomy for haem
    Neuronpedia logo
    GEMMA-2-2B
    0-GEMMASCOPE-RES-16K
    INDEX 0
    3 – TOP_POSITIVE_LOGITS mostly photography terms → photo captions
    o3
    .↵↵A man walks out of a branch of
    Neuronpedia logo
    GPT2-SMALL
    6-RES_POST_32K-OAI
    INDEX 4202
    nanotechnology
    gpt-4o-mini
    materials, energy storage materials and adsorbent materials.<bos>
    Neuronpedia logo
    GEMMA-2-2B
    25-GEMMASCOPE-TRANSCODER-16K
    INDEX 1662
    sounds
    gpt-4o-mini
     music sting and the unmistakable metallic-twang that constituted
    Neuronpedia logo
    GEMMA-2-2B
    21-GEMMASCOPE-TRANSCODER-16K
    INDEX 11060
    consciousness
    gemini-2.0-flash
    the typically metaphysical distinction between what is inside the box
    Neuronpedia logo
    LLAMA3.1-8B
    21-LLAMASCOPE-RES-32K
    INDEX 17571
    consciousness
    claude-3-5-haiku-20241022
    the typically metaphysical distinction between what is inside the box
    Neuronpedia logo
    LLAMA3.1-8B
    21-LLAMASCOPE-RES-32K
    INDEX 17571
    combination
    gpt-4o-mini
     generational, lifestage or a hybrid combination.↵↵Gener
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-RES-16K
    INDEX 40
    altogether
    claude-3-5-haiku-20241022
    as he skipped the vote altogether to attend a Christmas party
    Neuronpedia logo
    GPT2-SMALL
    6-RES-JB
    INDEX 23
    letter u
    claude-3-7-sonnet-20250219
    1]: *** [libuv.la] Error
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-RES-16K
    INDEX 36
    be
    gemini-2.0-flash
     and feathers.↵↵To be fair, wind turbines do
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-RES-16K
    INDEX 35
    gather
    gemini-2.0-flash
    seen as weak<|endoftext|>Students gather for lunch in the f
    Neuronpedia logo
    GPT2-SMALL
    6-RES-JB
    INDEX 125
    numbers and counts
    gemini-2.0-flash
     multiple titles in a season four times, including six titles
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-TRANSCODER-16K
    INDEX 16372
    chat logs and code
    gemini-2.0-flash
    <Mobidoy> C'est quoi Etsy ?
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-TRANSCODER-16K
    INDEX 16277
    Scientific references
    gemini-2.0-flash
     001195263.
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-TRANSCODER-16K
    INDEX 16269
    research/citations
    gemini-2.0-flash
    005; @Petri2015
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-TRANSCODER-16K
    INDEX 16317