© Neuronpedia 2026
Privacy & Terms
Blog
GitHub
Slack
Twitter
Contact
Neuronpedia
Natural Language
Autoencoders
NEW
Assistant Axis
NEW
Circuit Tracer
UPDATE
Releases
Jump To
Search
Models
Steer
SAE Evals
Exports
Guides
API
Community
Blog
Privacy & Terms
Contact
Sign In
Home
Google DeepMind · Exploring Gemma 2 with Gemma Scope
Gemma-2-9B
Residual Stream - 16k
28-GEMMASCOPE-RES-16K
8256
Prev
Next
MODEL
28-gemmascope-res-16k
Source/SAE
INDEX
Go
Explanations
text related to testimonies or statements in a narrative context
oai_token-act-pair · gpt-4o-mini
Triggered by @bot
No Scores
preceding pronouns referring to women
np_acts-logits-general · gemini-2.0-flash
No Scores
she or her pronouns
np_acts-logits-general · gemini-2.5-flash-lite
No Scores
New Auto-Interp
AutoInterp Type
claude-4-5-haiku
Generate
Top Features by Cosine Similarity
Comparing With
GEMMA-2-9B @ 28-gemmascope-res-16k
Configuration
google/gemma-scope-9b-pt-res/layer_28/width_16k/average_l0_119
How To Load
Prompts (Dashboard)
24,576 prompts, 128 tokens each
Dataset (Dashboard)
monology/pile-uncopyrighted
Features
16,384
Data Type
float32
Hook Name
blocks.28.hook_resid_post
Hook Layer
28
Architecture
jumprelu
Context Size
1,024
Dataset
monology/pile-uncopyrighted
Activation Function
relu
Show All
Embeds
Show Plots
Show Explanation
Show Activations
Show Test Field
Show Steer
Show Link
IFrame
<iframe src="https://www.neuronpedia.org/gemma-2-9b/28-gemmascope-res-16k/8256?embed=true&embedexplanation=true&embedplots=true&embedsteer=true&embedactivations=true&embedlink=true&embedtest=true" title="Neuronpedia" style="height: 300px; width: 540px;"></iframe>
Link
https://www.neuronpedia.org/gemma-2-9b/28-gemmascope-res-16k/8256?embed=true&embedexplanation=true&embedplots=true&embedsteer=true&embedactivations=true&embedlink=true&embedtest=true
Not in Any Lists
Add to List
▼
No Comments
ADD
Negative Logits
he
-0.86
him
-0.71
[]*
-0.69
ông
-0.67
él
-0.67
husband
-0.63
Boyfriend
-0.63
он
-0.63
เขา
-0.62
nele
-0.62
POSITIVE LOGITS
she
1.51
彼女
1.16
그녀
1.09
彼女は
1.00
she
1.00
она
0.94
вона
0.93
Ms
0.90
เธอ
0.90
彼女
0.88
Act
ivations
Density 1.378%
Stacked
Snippet
Full
Show Raw Tokens
Show Formatted
Show Breaks
Hide Breaks
No Known Activations