© Neuronpedia 2026
    Privacy & TermsBlogGitHubSlackTwitterContact
    Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    Natural Language
    Autoencoders
    NEW
    Assistant AxisNEWCircuit TracerUPDATESteerSAE EvalsExportsAPI Community BlogPrivacy & TermsContact
    1. Home
    2. Llama3.1-8B-IT
    llama3.1-8b-it
    Meta

    Releases

    Finding Misaligned Persona Features in Open-Weight Models
    September 2025
    Andy Arditi
    misaligned-persona

    Attention Visualizer

    HeadVis (Luger, Kamath et al.)
    Find Head By Metric
    Metric & Number of Heads
    Top 8
    Click head to select
    Layer 2
    Layer 5
    Layer 10
    Layer 15
    Layer 16
    Select Head Manually
    Layer
    Head Index

    Activation Distribution & Metrics

    Q-K Distance Distribution

    Top Query Tokens

    Top Key Tokens

    Top Activating Sequences

    Jump To

    Jump to Source/SAE
    Jump to Feature
    INDEX
    Random Feature

    Search Explanations

    Search via Inference

    Run Example Search

    Browse

    Features in LLAMA3.1-8B-IT@11-resid-post-aa
    1. Hover over a feature on the left to preview its details.
    2. Click a feature to lock it and interact with it.