Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APICircuit TracerNEWSteerSAE EvalsExportsSlackBlogPrivacy & TermsContact
    © Neuronpedia 2025
    Privacy & TermsBlogGitHubSlackTwitterContact
    EXPLANATION TYPE
    np_max-act-logits
    Description
    A Neuronpedia original that attempts to replicate Anthropic's autointerp used for their attribution graphs paper's features.
    Author
    Neuronpedia
    URL
    https://github.com/hijohnnylin/automated-interpretability/blob/4463a9fab7d4828bfd4c33194e64856b95377166/neuron_explainer/explanations/explainer.py#L811-L1135
    Settings
    Activations shown = 24 tokens around max act. Shows top 10 logits. Shows model the max activating token too.
    Recent Explanations
    able
    gemini-2.0-flash
    75%.↵↵Indispensable in the information wars↵↵
    Neuronpedia logo
    QWEN2.5-7B-IT
    27-RESID-POST-AA
    INDEX 131057
    Software configuration files
    gemini-2.0-flash
    _TEST=m↵CONFIG_MODULE_UNLOAD=y↵CONFIG_SUS
    Neuronpedia logo
    QWEN2.5-7B-IT
    27-RESID-POST-AA
    INDEX 131063
    F
    gemini-2.0-flash
    ída, cenoura ralada, queijo
    Neuronpedia logo
    QWEN2.5-7B-IT
    27-RESID-POST-AA
    INDEX 131056
    char
    gemini-2.0-flash
    he so often treats uncharitably in his attack videos
    Neuronpedia logo
    QWEN2.5-7B-IT
    27-RESID-POST-AA
    INDEX 131066
    German language
    gemini-2.0-flash
    wie üblich flirtend begrüßt.<|im_end|>
    Neuronpedia logo
    QWEN2.5-7B-IT
    27-RESID-POST-AA
    INDEX 131028
    Pharmaceutical/chemical context
    gemini-2.0-flash
    the Synthetic Routes of Flubanilate 20
    Neuronpedia logo
    QWEN2.5-7B-IT
    27-RESID-POST-AA
    INDEX 131065
    cancer support
    gemini-2.0-flash
    : Support and resources for those facing the journey with a
    Neuronpedia logo
    QWEN2.5-7B-IT
    27-RESID-POST-AA
    INDEX 131069
    Codes and abbreviations
    gemini-2.0-flash
    esse daran haben, gucke ich gerne mal rein
    Neuronpedia logo
    QWEN2.5-7B-IT
    27-RESID-POST-AA
    INDEX 131039
    Code snippets
    gemini-2.0-flash
    heated rooms offer a flat-s..↵↵3 place de l
    Neuronpedia logo
    QWEN2.5-7B-IT
    27-RESID-POST-AA
    INDEX 131008
    Game files and directories
    gemini-2.0-flash
    into the Fallout 4 main folder and Data folder.
    Neuronpedia logo
    QWEN2.5-7B-IT
    27-RESID-POST-AA
    INDEX 131047
    Cython code
    gemini-2.0-flash
    , and then use the `cdivision` directive to
    Neuronpedia logo
    QWEN2.5-7B-IT
    27-RESID-POST-AA
    INDEX 131068
    Different languages
    gemini-2.0-flash
    ant in zwei Sätzen zusammenfassen?↵↵F
    Neuronpedia logo
    QWEN2.5-7B-IT
    27-RESID-POST-AA
    INDEX 131061
    sch
    gemini-2.0-flash
    , Ansible playbooks schrijven en uitvo
    Neuronpedia logo
    QWEN2.5-7B-IT
    27-RESID-POST-AA
    INDEX 131048
    Technical/scientific context
    gemini-2.0-flash
    and polyphagia (amplified hunger).
    Neuronpedia logo
    QWEN2.5-7B-IT
    27-RESID-POST-AA
    INDEX 131060
    Business acquisitions
    gemini-2.0-flash
    empresa, incluindo acionistas, clientes,
    Neuronpedia logo
    QWEN2.5-7B-IT
    27-RESID-POST-AA
    INDEX 131070
    code snippets
    gemini-2.0-flash
    ListDataCompare) (const void *data0,
    Neuronpedia logo
    QWEN2.5-7B-IT
    27-RESID-POST-AA
    INDEX 131021
    Code and Data
    gemini-2.0-flash
    from a drilling vessel or barge, an impending storm
    Neuronpedia logo
    QWEN2.5-7B-IT
    27-RESID-POST-AA
    INDEX 131071
    Formal language
    gemini-2.0-flash
    same, but make them more literary and improve my expression
    Neuronpedia logo
    QWEN2.5-7B-IT
    27-RESID-POST-AA
    INDEX 131013
    foreign languages
    gemini-2.0-flash
    bine. Sunt gata să încep
    Neuronpedia logo
    QWEN2.5-7B-IT
    27-RESID-POST-AA
    INDEX 131036
    say "urch" or "ur"
    gemini-2.0-flash
    you can find free Christian/church video or movie clips
    Neuronpedia logo
    QWEN2.5-7B-IT
    27-RESID-POST-AA
    INDEX 131052