© Neuronpedia 2026
    Privacy & TermsBlogGitHubSlackTwitterContact
    Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    Natural Language
    Autoencoders
    NEW
    Assistant AxisNEWCircuit TracerUPDATESteerSAE EvalsExportsAPI Community BlogPrivacy & TermsContact
    1. Home
    2. Qwen3-1.7B
    3. 27-LLAMASCOPE-2-LORSA-16K-K64
    4. 15650
    Prev
    Next
    INDEX
    Explanations

    say damage

    unknown · unknown
    New Auto-Interp
    Top Features by Cosine Similarity
    Embeds
    IFrame
    Link
    Not in Any Lists

    No Comments

    Negative Logits
     quo
    -16.88
    国旗
    -16.25
    -plugins
    -15.56
     flags
    -15.50
    dia
    -15.13
     LOCK
    -15.00
    lock
    -14.94
    /packages
    -14.88
    abbix
    -14.63
    vpn
    -14.56
    POSITIVE LOGITS
     damage
    36.00
    损伤
    33.75
    damage
    33.75
     Damage
    32.25
    伤害
    32.25
    损害
    31.63
    Damage
    31.25
     damaged
    29.63
     DAMAGE
    29.38
    _damage
    28.88
    Activations Density 0.496%

    No Known Activations