© Neuronpedia 2026
    Privacy & TermsBlogGitHubSlackTwitterContact
    Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    Natural Language
    Autoencoders
    NEW
    Assistant AxisNEWCircuit TracerUPDATESteerSAE EvalsExportsAPI Community BlogPrivacy & TermsContact
    1. Home
    2. Gemma-3-12B-IT
    3. 12-GEMMASCOPE-2-RES-16K
    4. 14523
    Prev
    Next
    INDEX
    Explanations

    **Analyze `TOP_ACTIVATING_TEXTS`**: * "changed to 'utilize SIAM'" * "**Utilize** Pantry Staples" * "changed to 'utilizes machine learning'" * "**Utility** Functions" * "**Utilizing** changed to 'powered by'" * "**Utilized** [Specific Inventory Software]" * "{type: **utility**, dollars: 150}" * "electric power isn't just a **utility**" These texts consistently use "utilize," "utilizes," "utilize," "Utility," "Utilized," and "utility." The context often involves actions (like using something, employing a function) or classifying something as a utility. The phrase "utilize" or "utility" is central.5. **Synthesize the pattern**: * The neuron is strongly activated by the word "utilize" and its variations ("utilizes", "utilizing", "utilized"). * It is also activated by the word "utility" and its variations. * The context suggests either the verb "to utilize" (meaning to make practical and effective use of) or the noun "utility" (referring to a service, usefulness, or a public service like electricity). * The `TOKENS_AFTER_MAX_ACTIVATING_TOKEN` hints at verb conjugations (`ize`, `izes`, `izing`, `ized`) and the noun "Functions". * The `TOP_POSITIVE_LOGITS` seem less relevant for this specific neuron's core function.6. **Formulate a concise explanation**: I need a phrase between 3 and 20 words. The core theme is "utilize" and "utility". Possible phrases: * "utilize or utility" (3 words) - A bit too simple. * "the act of utilizing or the concept of utility" (9 words) - Good, captures both meanings. * "using something effectively or a service" (7 words) - Explains the meaning. * "utilize and utility" (3 words) - Still too simple. * "verbs and nouns related to utilization or utility" (8 words) - A bit meta. * "how to use or the word utility" (7 words) - Focuses on meaning. * "utilize, utilizes, utility, utilized" (4 words) - Just listing. * "making effective use or usefulness" (5 words) - Captures semantic meaning. * "utilize or utility" seems the most direct and fits the "finding patterns in lists" directive. Let's re-examine the rules. * "Find patterns". The pattern is the word itself and its common forms/meanings. * "Concise explanation (3 to 20 words)". * "It could be a single word, or phrase, or pattern." * "Avoid simply listing all the tokens. Instead, try to find patterns." * "Just say the pattern itself". The simplest

    np_acts-logits-general · gemini-2.5-flash-lite
    New Auto-Interp
    Top Features by Cosine Similarity
    Configuration
    google/gemma-scope-2-12b-it/resid_post/layer_12_width_16k_l0_medium
    Prompts (Dashboard)
    238,145 prompts, 512 tokens each
    Dataset (Dashboard)
    lmsys + oasst1
    No Configuration Found
    Embeds
    IFrame
    Link
    Not in Any Lists

    No Comments

    Negative Logits
    ور
    3.41
    اد
    3.00
    ри
    2.25
    いる
    2.25
    رد
    2.25
    ியது
    2.25
    но
    2.20
    یه
    2.17
    ки
    2.16
    ur
    2.05
    POSITIVE LOGITS
    د
    2.34
    ر
    2.31
    ্স
    2.17
    tr
    2.11
    mm
    2.08
     ώστε
    1.98
    ter
    1.98
    ு
    1.96
    て
    1.94
    mitted
    1.90
    Activations Density 0.029%

    No Known Activations