© Neuronpedia 2026
    Privacy & TermsBlogGitHubSlackTwitterContact
    Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    Natural Language
    Autoencoders
    NEW
    Assistant AxisNEWCircuit TracerUPDATESteerSAE EvalsExportsAPI Community BlogPrivacy & TermsContact
    1. Home
    2. Qwen3-1.7B
    3. 26-LLAMASCOPE-2-LORSA-16K-K64
    4. 626
    Prev
    Next
    INDEX
    Explanations

    say "Darwin"

    unknown · unknown
    New Auto-Interp
    Top Features by Cosine Similarity
    Embeds
    IFrame
    Link
    Not in Any Lists

    No Comments

    Negative Logits
    踹
    -17.88
     LM
    -17.63
     MF
    -16.88
     UIF
    -16.75
    HF
    -16.50
     HF
    -16.50
     OpCode
    -16.38
    澧
    -16.38
    店
    -16.25
     StyleSheet
    -16.13
    POSITIVE LOGITS
     Darwin
    48.25
    达尔
    38.25
    Dar
    35.75
    darwin
    34.00
     DAR
    28.00
    生物
    25.75
    恐龙
    24.13
     biologist
    24.00
     dinosaur
    23.88
     dar
    23.13
    Activations Density 0.053%

    No Known Activations