Neuronpedia
Get Started
API
Releases
Jump To
Search
Models
Circuit Tracer
NEW
Steer
SAE Evals
Exports
Slack
Blog
Privacy & Terms
Contact
Sign In
© Neuronpedia 2025
Privacy & Terms
Blog
GitHub
Slack
Twitter
Contact
Home
Models
Qwen2.5-7B-IT
qwen2.5-7b-it
Alibaba
Releases
Finding Misaligned Persona Features in Open-Weight Models
September 2025
Andy Arditi
misaligned-persona
Jump To
Jump to Source/SAE
MODEL
11-resid-post-aa
Source/SAE
Go
Jump to Feature
MODEL
Source/SAE
INDEX
Go
Random Feature
Random
Search Explanations
All
By Release
By Model
By Sources
MODEL
Show Dashboards
Hide Dashboards
Browse
MODEL
Resid Post - 131k
LAYER
Features in
QWEN2.5-7B-IT
@
11-resid-post-aa
Hover over a feature on the left to preview its details.
Click a feature to lock it and interact with it.