Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APICircuit TracerNEWSteerSAE EvalsExportsSlackBlogPrivacy & TermsContact
    © Neuronpedia 2025
    Privacy & TermsBlogGitHubSlackTwitterContact
    1. Home
    2. GPT2-Small
    3. Transcoders Residuals
    4. 8-TRES-DC
    5. 218
    Prev
    Next
    INDEX
    Explanations

    instances of the word "to" indicating actions or submissions

    oai_token-act-pair · gpt-4o-miniTriggered by @bot
    New Auto-Interp
    Top Features by Cosine Similarity
    Embeds
    IFrame
    Link
    Not in Any Lists

    No Comments

    Negative Logits
    ワン
    -0.96
    覚醒
    -0.86
     freely
    -0.77
    グ
    -0.75
     cheaply
    -0.74
    leeve
    -0.71
    bodied
    -0.69
    accessible
    -0.68
     furiously
    -0.67
    565
    -0.67
    POSITIVE LOGITS
     Michele
    0.75
     Polit
    0.73
     us
    0.71
     me
    0.71
     POLITICO
    0.70
     Manny
    0.70
     Danielle
    0.69
     Ralph
    0.68
     Northwestern
    0.68
     Herb
    0.68
    Activations Density 0.211%

    No Known Activations