Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APICircuit TracerNEWSteerSAE EvalsExportsSlackBlogPrivacy & TermsContact
    © Neuronpedia 2025
    Privacy & TermsBlogGitHubSlackTwitterContact
    1. Home
    2. Gemma-2-2B
    3. 20-GEMMASCOPE-TRANSCODER-16K
    4. 14022
    Prev
    Next
    INDEX
    Explanations

    first person statements of possibility, ability, or intent involving "I" and "can," "would," or "will."

    oai_token-act-pair · gemini-2.0-flash

    can

    np_max-act-logits · gemini-2.0-flash
    New Auto-Interp
    Top Features by Cosine Similarity
    Configuration
    google/gemma-scope/layer_20/width_16k
    Prompts (Dashboard)
    24,576 prompts, 128 tokens each
    Dataset (Dashboard)
    monology/pile-uncopyrighted
    No Configuration Found
    Embeds
    IFrame
    Link
    Not in Any Lists

    No Comments

    Negative Logits
    DockStyle
    -0.79
     hasn
    -0.69
     BoxFit
    -0.66
    extAlignment
    -0.66
     has
    -0.65
     المعيارى
    -0.65
    出版年
    -0.64
    Története
    -0.62
    igshid
    -0.61
    Personensuche
    -0.61
    POSITIVE LOGITS
    зулта
    0.60
     disambiguazione
    0.55
     also
    0.50
     easily
    0.48
     actually
    0.46
     well
    0.45
    RuleContext
    0.45
     grà
    0.44
     profit
    0.43
     capit
    0.43
    Activations Density 2.245%

    No Known Activations