© Neuronpedia 2026
Privacy & Terms
Blog
GitHub
Slack
Twitter
Contact
Neuronpedia
Natural Language
Autoencoders
NEW
Assistant Axis
NEW
Circuit Tracer
UPDATE
Releases
Jump To
Search
Models
Steer
SAE Evals
Exports
Guides
API
Community
Blog
Privacy & Terms
Contact
Sign In
Home
Under Peer Review · Attention SAE Research Paper
GPT2-Small
Attention Out
9-ATT-KK
10461
Prev
Next
MODEL
9-att-kk
Source/SAE
INDEX
Go
Explanations
keywords related to programming and website components
oai_token-act-pair · gpt-4o-mini
Triggered by @bot
New Auto-Interp
AutoInterp Type
claude-4-5-haiku
Generate
Top Features by Cosine Similarity
Comparing With
GPT2-SMALL @ 9-att-kk
Configuration
ckkissane/attn-saes-gpt2-small-all-layers/gpt2-small_L9_Hcat_z_lr1.20e-03_l11.20e+00_ds24576_bs4096_dc1.00e-06_rsanthropic_rie25000_nr4_v9.pt
How To Load
Prompts (Dashboard)
36,864 prompts, 128 tokens each
Dataset (Dashboard)
Skylion007/openwebtext
Features
24,576
Data Type
float32
Hook Name
blocks.9.attn.hook_z
Hook Layer
9
Architecture
standard
Context Size
128
Dataset
Skylion007/openwebtext
Activation Function
relu
Show All
Embeds
Show Plots
Show Explanation
Show Activations
Show Test Field
Show Steer
Show Link
IFrame
<iframe src="https://www.neuronpedia.org/gpt2-small/9-att-kk/10461?embed=true&embedexplanation=true&embedplots=true&embedsteer=true&embedactivations=true&embedlink=true&embedtest=true" title="Neuronpedia" style="height: 300px; width: 540px;"></iframe>
Link
https://www.neuronpedia.org/gpt2-small/9-att-kk/10461?embed=true&embedexplanation=true&embedplots=true&embedsteer=true&embedactivations=true&embedlink=true&embedtest=true
Not in Any Lists
Add to List
▼
No Comments
ADD
Head Attr Weights
0:
0.06
1:
0.04
2:
0.02
3:
0.06
4:
0.06
5:
0.03
6:
0.22
7:
0.04
8:
0.08
9:
0.06
10:
0.07
11:
0.21
Negative Logits
—"
-3.48
…"
-3.41
…."
-3.38
Neander
-3.32
…]
-3.20
…"
-3.08
–
-3.07
film
-3.06
anski
-3.00
Zamb
-2.99
POSITIVE LOGITS
||
8.91
||
6.83
&&
5.09
)|
4.79
||||
4.58
quished
4.42
<<
4.41
===
4.38
otaur
4.27
:=
4.20
Act
ivations
Density 0.001%
Test
Steer
Stacked
Snippet
Full
Split DFA
Combine DFA
Show Raw Tokens
Show Formatted
Show Breaks
Hide Breaks
No Known Activations