Neuronpedia
Get Started
API
Releases
Jump To
Search
Models
Circuit Tracer
NEW
Steer
SAE Evals
Exports
Slack
Blog
Privacy & Terms
Contact
Sign In
© Neuronpedia 2025
Privacy & Terms
Blog
GitHub
Slack
Twitter
Contact
Vector Label
refusal (Arditi et al. 2024)
Model
gemma-2-2b-it
Layer #
15
Steering Hook
blocks.15.hook_resid_pre
Steering Strength
0.25
Uploader
bot-neuronpedia
Created At
11/20/2024 9:49:19 AM
Raw Vector
Actions
Steer
Explanations
No Explanations Found
New Auto-Interp
AutoInterp Type
claude-4-5-haiku
Generate
Top Features by Cosine Similarity
Embeds
Show Plots
Show Explanation
Show Activations
Show Test Field
Show Steer
Show Link
IFrame
<iframe src="https://www.neuronpedia.org/gemma-2-2b-it/15-neuronpedia-resid-pre/0?embed=true&embedexplanation=true&embedplots=true&embedsteer=true&embedactivations=true&embedlink=true&embedtest=true" title="Neuronpedia" style="height: 300px; width: 540px;"></iframe>
Link
https://www.neuronpedia.org/gemma-2-2b-it/15-neuronpedia-resid-pre/0?embed=true&embedexplanation=true&embedplots=true&embedsteer=true&embedactivations=true&embedlink=true&embedtest=true
Not in Any Lists
Add to List
▼
No Comments
ADD
Test
Steer
No Known Activations