Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APICircuit TracerNEWSteerSAE EvalsExportsSlackBlogPrivacy & TermsContact
    © Neuronpedia 2025
    Privacy & TermsBlogGitHubSlackTwitterContact
    EXPLANATION TYPE
    np_acts-logits-general
    Description
    A Neuronpedia original that gives a short explanation, relying more on the intelligence of the model rather than directing it through a lot of instructions or examples. Ideal for smarter models like gemini-flash-2.0 or better.
    Author
    Neuronpedia
    URL
    https://github.com/hijohnnylin/automated-interpretability/blob/df609dbc46356fa25e6aaa4d48d4b23ba97284ed/neuron_explainer/explanations/explainer.py#L1187
    Settings
    Activations shown = 24 tokens around max act. Shows top 10 logits. Shows model the max activating token too.
    Recent Explanations
    Code or programming related text
    gemini-2.0-flash
    interface StoreGetter {↵    <T>(): T
    Neuronpedia logo
    GEMMA-2-9B-IT
    31-GEMMASCOPE-RES-16K
    INDEX 15550
    Follows "same", "the", "according", or "isomorphic"
    gemini-2.0-flash
     visited = new HashSet<>();↵    Queue<Integer>
    Neuronpedia logo
    GEMMA-2-9B-IT
    31-GEMMASCOPE-RES-16K
    INDEX 15689
    Followed by certain punctuation or "user"
    gemini-2.0-flash
     Meetings in collaboration with the most prestigious scientific institutions of our
    Neuronpedia logo
    GEMMA-2-9B-IT
    31-GEMMASCOPE-RES-16K
    INDEX 16337
    Characters in mathematical formulas
    gemini-2.0-flash
    .↵↵February, 2012↵↵Ri
    Neuronpedia logo
    GEMMA-2-9B-IT
    31-GEMMASCOPE-RES-16K
    INDEX 15607
    after "lifting," "using," or "the"
    gemini-2.0-flash
    , locomotives, ships, cranes, heavy trucks, earth
    Neuronpedia logo
    GEMMA-2-9B-IT
    31-GEMMASCOPE-RES-16K
    INDEX 15941
    Follows "menu" or "<start_of_turn>" tokens
    gemini-2.0-flash
    DETECT_DEADLOCK));↵        detectDeadlockButton
    Neuronpedia logo
    GEMMA-2-9B-IT
    31-GEMMASCOPE-RES-16K
    INDEX 15945
    Code, punctuation, or start of turn
    gemini-2.0-flash
                        SwingUtilities.invokeLater(new Runnable() {↵
    Neuronpedia logo
    GEMMA-2-9B-IT
    31-GEMMASCOPE-RES-16K
    INDEX 15527
    Following "talk" or "<start_of_turn>user"
    gemini-2.0-flash
    <bos>Write a brief – sentence
    Neuronpedia logo
    GEMMA-2-9B-IT
    31-GEMMASCOPE-RES-16K
    INDEX 15834
    German places or regions
    gemini-2.0-flash
     female students enrolled at Ulm/D. University in order
    Neuronpedia logo
    GEMMA-2-9B-IT
    31-GEMMASCOPE-RES-16K
    INDEX 16262
    patience and tolerance
    gemini-2.0-flash
     did well and has the patience of a saint considering he
    Neuronpedia logo
    GEMMA-2-9B-IT
    31-GEMMASCOPE-RES-16K
    INDEX 15530
    "Forward" or direction-related words
    gemini-2.0-flash
    .↵↵For example, forward-looking hackers could begin
    Neuronpedia logo
    GEMMA-2-9B-IT
    31-GEMMASCOPE-RES-16K
    INDEX 15454
    Log-related terms and file paths
    gemini-2.0-flash
    (Logger::DEBUG));↵        $this->assertFalse
    Neuronpedia logo
    GEMMA-2-9B-IT
    31-GEMMASCOPE-RES-16K
    INDEX 15597
    Mathematical/statistical deviation or tolerance
    gemini-2.0-flash
    28 cm with a possible error in measurement of at
    Neuronpedia logo
    GEMMA-2-9B-IT
    31-GEMMASCOPE-RES-16K
    INDEX 16226
    "for" or start of turn followed by "user"
    gemini-2.0-flash
     by himself while the little one only concentrated on the cup
    Neuronpedia logo
    GEMMA-2-9B-IT
    31-GEMMASCOPE-RES-16K
    INDEX 15372
    Superpowers or superhuman abilities
    gemini-2.0-flash
     in coding of purple. This only works on beings equal
    Neuronpedia logo
    GEMMA-2-9B-IT
    31-GEMMASCOPE-RES-16K
    INDEX 15959
    Convolutional Neural Network (CNN) architectures
    gemini-2.0-flash
     on saliency maps learned by a Convolutional Neural Network
    Neuronpedia logo
    GEMMA-2-9B-IT
    31-GEMMASCOPE-RES-16K
    INDEX 16227
    Code-related keywords and context
    gemini-2.0-flash
    ("WS")↵    public String ws;↵↵    
    Neuronpedia logo
    GEMMA-2-9B-IT
    31-GEMMASCOPE-RES-16K
    INDEX 16208
    ending in "rol," "rel," "hel,""el," or "al"
    gemini-2.0-flash
     in a case involving Mariel Cubans. Id. at
    Neuronpedia logo
    GEMMA-2-9B-IT
    31-GEMMASCOPE-RES-16K
    INDEX 16200
    Followed by indefinite pronouns (some, many)
    gemini-2.0-flash
     may interfere with your use of some of our sites or
    Neuronpedia logo
    GEMMA-2-9B-IT
    31-GEMMASCOPE-RES-16K
    INDEX 15506
    "suspend" or "suspension"
    gemini-2.0-flash
    <start_of_turn>user↵<bos> from Susy and Geno!↵↵
    Neuronpedia logo
    GEMMA-2-9B-IT
    31-GEMMASCOPE-RES-16K
    INDEX 16045