Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APICircuit TracerNEWSteerSAE EvalsExportsSlackBlogPrivacy & TermsContact
    © Neuronpedia 2025
    Privacy & TermsBlogGitHubSlackTwitterContact
    1. Home
    2. Qwen3-4B
    3. 30-TRANSCODER-HP
    4. 15089
    Prev
    Next
    INDEX
    Explanations

    what

    np_max-act-logits · gemini-2.0-flash
    New Auto-Interp
    Top Features by Cosine Similarity
    Configuration
    Prompts (Dashboard)
    16,384 prompts, 128 tokens each
    Dataset (Dashboard)
    monology/pile-uncopyrighted
    No Configuration Found
    Embeds
    IFrame
    Link
    Not in Any Lists

    No Comments

    Negative Logits
     does
    -0.50
    does
    -0.45
     might
    -0.43
     did
    -0.41
     exactly
    -0.40
     именно
    -0.39
     Does
    -0.38
    did
    -0.37
     could
    -0.37
    _does
    -0.36
    POSITIVE LOGITS
    erate
    0.28
    opens
    0.26
    eca
    0.26
    ä¹Łæ²¡ä»Ģä¹Ī
    0.25
    亲
    0.25
    rü
    0.24
    _ALLOWED
    0.24
    eden
    0.24
    entious
    0.23
    çļĦåΰæĿ¥
    0.23
    Activations Density 0.888%

    No Known Activations