Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APICircuit TracerNEWSteerSAE EvalsExportsSlackBlogPrivacy & TermsContact
    © Neuronpedia 2025
    Privacy & TermsBlogGitHubSlackTwitterContact
    EXPLANATION TYPE
    np_max-act
    Description
    A Neuronpedia original that forces concise explanations and shows the model the top activating tokens and texts. A simpler version of np_max-act-logits.
    Author
    Neuronpedia
    URL
    https://github.com/hijohnnylin/automated-interpretability/blob/917b11e38111c43526fe03ae6094a7081aeb982a/neuron_explainer/explanations/explainer.py#L1181
    Settings
    Activations shown = 24 tokens around max act. Shows model the max activating token too.
    Recent Explanations
    or
    o4-mini
    Traffic Prioritization," or "Bandwidth Management."
    Neuronpedia logo
    LLAMA3.1-8B-IT
    11-RESID-POST-AA
    INDEX 81285
    hyphen
    claude-3-5-haiku-20241022
    Magazine Ads** – Full‑color spread in *V
    Neuronpedia logo
    GPT-OSS-20B
    15-RESID-POST-AA
    INDEX 21
    inspiration Method used: 2, Reason: Texts consistently discuss uplifting, forward-looking themes about human potential and meaning
    claude-3-5-haiku-20241022
    illuminate a path toward a more humane world. His work
    Neuronpedia logo
    GPT-OSS-20B
    15-RESID-POST-AA
    INDEX 2
    the
    claude-3-5-haiku-20241022
    the basics—what’s the difference between a set and
    Neuronpedia logo
    GPT-OSS-20B
    15-RESID-POST-AA
    INDEX 1
    legal terms
    claude-3-5-haiku-20241022
    KIND, either express or implied.↵# See
    Neuronpedia logo
    QWEN3-4B
    27-TRANSCODER-HP
    INDEX 1198
    下
    gemini-2.0-flash
    安全属性的前提下显著提升处理器
    Neuronpedia logo
    GPT-OSS-20B
    7-RESID-POST-AA
    INDEX 131025
    polenta
    gemini-2.0-flash
    ↵ - The **LM curve** is the set
    Neuronpedia logo
    GPT-OSS-20B
    7-RESID-POST-AA
    INDEX 131006
    paradox of choice
    gemini-2.0-flash
    . “The paradox of choice” (Schwartz,
    Neuronpedia logo
    GPT-OSS-20B
    7-RESID-POST-AA
    INDEX 130993
    AN
    gemini-2.0-flash
    F, SCAN } DiskScheduleAlgorithm
    Neuronpedia logo
    GPT-OSS-20B
    7-RESID-POST-AA
    INDEX 131044
    cereal crops
    gemini-2.0-flash
    рожь, пшеницу, просо,
    Neuronpedia logo
    GPT-OSS-20B
    7-RESID-POST-AA
    INDEX 131042
    Listicle titles
    gemini-2.0-flash
    more: 7 Surprising Health Facts About Coffee
    Neuronpedia logo
    GPT-OSS-20B
    7-RESID-POST-AA
    INDEX 131039
    code, paths, commands
    gemini-2.0-flash
    libgtk-3.so.0` 等)不
    Neuronpedia logo
    GPT-OSS-20B
    7-RESID-POST-AA
    INDEX 131036
    Night Watch painting
    gemini-2.0-flash
    the Painting 'The Night Watch' 1642 by
    Neuronpedia logo
    GPT-OSS-20B
    7-RESID-POST-AA
    INDEX 131063
    Telomere
    gemini-2.0-flash
    ↵↵ - Telomere attrition↵↵ -
    Neuronpedia logo
    GPT-OSS-20B
    7-RESID-POST-AA
    INDEX 131001
    cells, glycoproteins
    gemini-2.0-flash
    | **Glycoproteins (prostate‑
    Neuronpedia logo
    GPT-OSS-20B
    7-RESID-POST-AA
    INDEX 131069
    situational
    gemini-2.0-flash
    than the characters. Situational irony occurs when a character
    Neuronpedia logo
    GPT-OSS-20B
    7-RESID-POST-AA
    INDEX 130987
    Cryptocurrency
    gemini-2.0-flash
    you've read the complete Divergent trilogy by Veronica Roth (
    Neuronpedia logo
    GPT-OSS-20B
    7-RESID-POST-AA
    INDEX 131022
    April
    gemini-2.0-flash
    **“Your Lie in April” (Shigatsu
    Neuronpedia logo
    GPT-OSS-20B
    7-RESID-POST-AA
    INDEX 131026
    math equations
    gemini-2.0-flash
    }^{d-1} ζ^{-r j} (
    Neuronpedia logo
    GPT-OSS-20B
    7-RESID-POST-AA
    INDEX 131046
    Computer science algorithms
    gemini-2.0-flash
    3. **多级反馈队列(Multi‑
    Neuronpedia logo
    GPT-OSS-20B
    7-RESID-POST-AA
    INDEX 131031