Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APICircuit TracerNEWSteerSAE EvalsBlogSlackPrivacy & TermsContact
    © Neuronpedia 2025
    Privacy & TermsBlogGitHubSlackTwitterContact
    EXPLANATION TYPE
    np_max-act-logits
    Description
    A Neuronpedia original that attempts to replicate Anthropic's autointerp used for their attribution graphs paper's features.
    Author
    Neuronpedia
    URL
    https://github.com/hijohnnylin/automated-interpretability/blob/4463a9fab7d4828bfd4c33194e64856b95377166/neuron_explainer/explanations/explainer.py#L811-L1135
    Settings
    Activations shown = 24 tokens around max act. Shows top 10 logits. Shows model the max activating token too.
    Recent Explanations
    consciousness
    gemini-2.0-flash
    the typically metaphysical distinction between what is inside the box
    Neuronpedia logo
    LLAMA3.1-8B
    21-LLAMASCOPE-RES-32K
    INDEX 17571
    consciousness
    claude-3-5-haiku-20241022
    the typically metaphysical distinction between what is inside the box
    Neuronpedia logo
    LLAMA3.1-8B
    21-LLAMASCOPE-RES-32K
    INDEX 17571
    combination
    gpt-4o-mini
     generational, lifestage or a hybrid combination.↵↵Gener
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-RES-16K
    INDEX 40
    altogether
    claude-3-5-haiku-20241022
    as he skipped the vote altogether to attend a Christmas party
    Neuronpedia logo
    GPT2-SMALL
    6-RES-JB
    INDEX 23
    letter u
    claude-3-7-sonnet-20250219
    1]: *** [libuv.la] Error
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-RES-16K
    INDEX 36
    be
    gemini-2.0-flash
     and feathers.↵↵To be fair, wind turbines do
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-RES-16K
    INDEX 35
    gather
    gemini-2.0-flash
    seen as weak<|endoftext|>Students gather for lunch in the f
    Neuronpedia logo
    GPT2-SMALL
    6-RES-JB
    INDEX 125
    numbers and counts
    gemini-2.0-flash
     multiple titles in a season four times, including six titles
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-TRANSCODER-16K
    INDEX 16372
    chat logs and code
    gemini-2.0-flash
    <Mobidoy> C'est quoi Etsy ?
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-TRANSCODER-16K
    INDEX 16277
    Scientific references
    gemini-2.0-flash
     001195263.
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-TRANSCODER-16K
    INDEX 16269
    research/citations
    gemini-2.0-flash
    005; @Petri2015
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-TRANSCODER-16K
    INDEX 16317
    Proper nouns
    gemini-2.0-flash
    .  ↵↵     Darcy, defendant's
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-TRANSCODER-16K
    INDEX 16370
    services for certain groups
    gemini-2.0-flash
     to serve as support for patients, families and hospital staff
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-TRANSCODER-16K
    INDEX 16316
    HTML code
    gemini-2.0-flash
    link rel="stylesheet" href={{ Mix "/css/
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-TRANSCODER-16K
    INDEX 16339
    medical research, health
    gemini-2.0-flash
    , yet certain child and family characteristics should be taken into
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-TRANSCODER-16K
    INDEX 16371
    beginnings or introductions
    gemini-2.0-flash
    14. He scored on his debut↵↵Central F
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-TRANSCODER-16K
    INDEX 16327
    "to be", "must be"
    gemini-2.0-flash
    . Things like that need to be considered. I do
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-TRANSCODER-16K
    INDEX 16307
    use
    gemini-2.0-flash
     manuscripts, for↵permissions to reproduce them in any format
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-TRANSCODER-16K
    INDEX 16291
    Citations
    gemini-2.0-flash
     vol. 89, no. 10
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-TRANSCODER-16K
    INDEX 16369
    code
    gemini-2.0-flash
    ['genre_id']))->row()->name;↵
    Neuronpedia logo
    GEMMA-2-2B
    20-GEMMASCOPE-TRANSCODER-16K
    INDEX 16324