Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APICircuit TracerNEWSteerSAE EvalsExportsSlackBlogPrivacy & TermsContact
    © Neuronpedia 2025
    Privacy & TermsBlogGitHubSlackTwitterContact
    1. Home
    2. Qwen3-4B
    3. 23-TRANSCODER-HP
    4. 162242
    Prev
    Next
    INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Top Features by Cosine Similarity
    Configuration
    Prompts (Dashboard)
    16,384 prompts, 128 tokens each
    Dataset (Dashboard)
    monology/pile-uncopyrighted
    No Configuration Found
    Embeds
    IFrame
    Link
    Not in Any Lists

    No Comments

    Negative Logits
    严
    -0.30
    乡
    -0.27
     Guill
    -0.27
     Saf
    -0.27
    é©°
    -0.26
     safely
    -0.26
    ä½Ļ
    -0.25
    æľīä¸ĢçĤ¹
    -0.24
     sacrific
    -0.24
     æŀ
    -0.24
    POSITIVE LOGITS
    ISK
    0.27
    SPA
    0.27
    erb
    0.26
    /MIT
    0.26
    estone
    0.25
    çĢį
    0.25
    å¥Ń
    0.24
    ople
    0.24
    dle
    0.24
    [arg
    0.24
    Activations Density 0.003%

    No Known Activations

    This feature has no known activations.