© Neuronpedia 2026
Privacy & Terms
Blog
GitHub
Slack
Twitter
Contact
Neuronpedia
Natural Language
Autoencoders
NEW
Assistant Axis
NEW
Circuit Tracer
UPDATE
Releases
Jump To
Search
Models
Steer
SAE Evals
Exports
Guides
API
Community
Blog
Privacy & Terms
Contact
Sign In
Home
Models
Gemma-2-2B-IT
gemma-2-2b-it
Google Deepmind
Jump to Source/SAE
20-axbench-reft-r1-res-16k
Source/SAE
Go
Jump to Feature
20-axbench-reft-r1-res-16k
Source/SAE
INDEX
Go
Releases
AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders
July 2024
pyvene.ai, The Stanford NLP Group
axbench
Attention Visualizer
HeadVis
(Luger, Kamath et al.)
Find Head By Metric
Metric & Number of Heads
Induction Score
Prev Token Score
Attention Entropy
Self Attention
Top 8
Click head to select
Layer 4
Head 4
Layer 6
Head 1
Head 2
Head 3
Head 4
Layer 14
Head 0
Layer 15
Head 0
Layer 18
Head 6
Select Head Manually
Layer
0
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
Head Index
0
1
2
3
4
5
6
7
Search Explanations
All
By Release
By Model
By Sources
MODEL
Show Dashboards
Hide Dashboards
Browse
MODEL
LAYER
Features in
GEMMA-2-2B-IT
@
20-axbench-reft-r1-res-16k
Hover over a feature on the left to preview its details.
Click a feature to lock it and interact with it.