© Neuronpedia 2026
    Privacy & TermsBlogGitHubSlackTwitterContact
    Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    Natural Language
    Autoencoders
    NEW
    Assistant AxisNEWCircuit TracerUPDATESteerSAE EvalsExportsAPI Community BlogPrivacy & TermsContact
    1. Home
    2. Qwen3-1.7B
    3. 27-LLAMASCOPE-2-LORSA-16K-K64
    4. 15476
    Prev
    Next
    INDEX
    Explanations

    安全

    unknown · unknown
    New Auto-Interp
    Top Features by Cosine Similarity
    Embeds
    IFrame
    Link
    Not in Any Lists

    No Comments

    Negative Logits
    -st
    -62.00
    (st
    -59.00
    St
    -58.75
    _st
    -58.75
    .st
    -58.00
    -St
    -56.00
     St
    -55.75
    	st
    -55.25
    .ST
    -54.75
    st
    -54.25
    POSITIVE LOGITS
     hẹn
    14.56
    lös
    14.19
    睚
    13.69
    渑
    13.63
     Lotus
    13.56
    莲花
    13.06
    ɸ
    12.56
    阱
    12.50
    葵
    12.38
    _VE
    12.25
    Activations Density 5.416%

    No Known Activations