© Neuronpedia 2026
Privacy & Terms
Blog
GitHub
Slack
Twitter
Contact
Neuronpedia
Get Started
API
Releases
Jump To
Search
Models
Assistant Axis
NEW
Circuit Tracer
NEW
Steer
SAE Evals
Exports
Community
Blog
Privacy & Terms
Contact
Sign In
Home
Andy Arditi · Finding Misaligned Persona Features in Open-Weight Models
Llama3.1-8B-IT
Resid Post - 131k
23-RESID-POST-AA
llama3.1-8b-it · 23-resid-post-aa
Source from
misaligned-persona
·
Resid Post - 131k
· Layer 23
Jump to Source/SAE
23-resid-post-aa
Source/SAE
Go
Jump to Feature
23-resid-post-aa
Source/SAE
INDEX
Go
Configuration
andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_23/trainer_1
How To Load
Features
131,072
Data Type
float32
Hook Name
blocks.23.hook_resid_post
Architecture
standard
Context Size
1,024
Dataset
monology/pile-uncopyrighted
Show All
Search via Inference
?
Resid Post - 131k
Layer 23
SEARCH
Run Example Search
Random
🌮 Food
📰 News
📖 Literary
👯 Personal
🧑💻 Programming
🧑🔬 Technical
🧑🏫 Academic
💼 Business
🧑⚖️ Legal
🧑🏫 Educational
🗼 Cultural
Search TopK by Token
MODEL
Resid Post - 131k
LAYER
RANDOM
SEARCH
Density Threshold:
0.75%
Reset
0%
1%
10%
100%
Show BOS
Hide BOS
Sort by Frequency
Sort by Max Act
Sort by Density
Search Explanations
All
By Release
By Model
By Sources
MODEL
Resid Post - 131k
Layer 23
Show Dashboards
Hide Dashboards
Browse
MODEL
Resid Post - 131k
LAYER
Features in
LLAMA3.1-8B-IT
@
23-resid-post-aa
Hover over a feature on the left to preview its details.
Click a feature to lock it and interact with it.