© Neuronpedia 2026
    Privacy & TermsBlogGitHubSlackTwitterContact
    Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APIAssistant AxisNEWCircuit TracerNEWSteerSAE EvalsExports Community BlogPrivacy & TermsContact
    1. Home
    2. Qwen3-1.7B
    3. 27-LLAMASCOPE-2-LORSA-16K-K64
    4. 15871
    Prev
    Next
    INDEX
    Explanations

    say deception

    unknown · unknown
    New Auto-Interp
    Top Features by Cosine Similarity
    Embeds
    IFrame
    Link
    Not in Any Lists

    No Comments

    Negative Logits
    瑗
    -16.00
    сон
    -16.00
    囡
    -15.63
    班车
    -15.56
    首都
    -15.44
    UPPORT
    -15.38
    ussed
    -15.31
    网站地图
    -15.25
    煦
    -15.13
    OCUS
    -15.06
    POSITIVE LOGITS
    欺骗
    38.25
     fake
    36.25
    骗
    35.75
     deception
    35.00
    (fake
    33.50
    骗局
    33.25
     Fake
    33.25
    欺诈
    33.00
    fake
    32.75
    诈
    32.50
    Activations Density 0.546%

    No Known Activations