Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APICircuit TracerNEWSteerSAE EvalsExportsSlackBlogPrivacy & TermsContact
    © Neuronpedia 2025
    Privacy & TermsBlogGitHubSlackTwitterContact
    1. Home
    2. Llama3.1-8B-IT (Instruct)
    llama3.1-8b-it
    Meta

    Releases

    Finding Misaligned Persona Features in Open-Weight Models
    September 2025
    Andy Arditi
    misaligned-persona

    Jump To

    Jump to Source/SAE
    Jump to Feature
    INDEX
    Random Feature

    Search Explanations

    Search via Inference

    Run Example Search

    Browse

    Features in LLAMA3.1-8B-IT@11-resid-post-aa
    1. Hover over a feature on the left to preview its details.
    2. Click a feature to lock it and interact with it.