© Neuronpedia 2026
Privacy & Terms
Blog
GitHub
Slack
Twitter
Contact
Neuronpedia
Natural Language
Autoencoders
NEW
Assistant Axis
NEW
Circuit Tracer
UPDATE
Releases
Jump To
Search
Models
Steer
SAE Evals
Exports
Guides
API
Community
Blog
Privacy & Terms
Contact
Sign In
Home
Gemma-3-12B
24-GEMMASCOPE-2-RES-16K
10802
Prev
Next
MODEL
24-gemmascope-2-res-16k
Source/SAE
INDEX
Go
Explanations
interrogative pronouns and punctuation
np_acts-logits-general · gemini-2.5-flash-lite
No Scores
This neuron fires on the very first word of each new comment or paragraph, marking the start of a block of text.
oai_token-act-pair · o4-mini
Triggered by @jyhe0408
No Scores
direct address to a person by name, typically at the start of a statement or sentence.
oai_token-act-pair · claude-4-5-sonnet
Triggered by @jyhe0408
No Scores
direct address (vocative) constructions, especially names or addressees set off by a following comma.
oai_token-act-pair · gpt-5
Triggered by @jyhe0408
No Scores
New Auto-Interp
AutoInterp Type
claude-4-5-haiku
Generate
Top Features by Cosine Similarity
Configuration
google/gemma-scope-2-12b-pt/resid_post/layer_24_width_16k_l0_medium
How To Load
Prompts (Dashboard)
392,802 prompts, 256 tokens each
Dataset (Dashboard)
monology/pile-uncopyrighted
No Configuration Found
Show All
Embeds
Show Plots
Show Explanation
Show Activations
Show Test Field
Show Steer
Show Link
IFrame
<iframe src="https://www.neuronpedia.org/gemma-3-12b/24-gemmascope-2-res-16k/10802?embed=true&embedexplanation=true&embedplots=true&embedsteer=true&embedactivations=true&embedlink=true&embedtest=true" title="Neuronpedia" style="height: 300px; width: 540px;"></iframe>
Link
https://www.neuronpedia.org/gemma-3-12b/24-gemmascope-2-res-16k/10802?embed=true&embedexplanation=true&embedplots=true&embedsteer=true&embedactivations=true&embedlink=true&embedtest=true
Not in Any Lists
Add to List
▼
No Comments
ADD
Negative Logits
DIEGO
0.66
।
0.65
்த
0.63
ﻬ
0.60
।
0.58
😹
0.58
MICHAEL
0.57
س
0.57
ח
0.57
ㅎㅎ
0.57
POSITIVE LOGITS
你需要
0.77
你可以
0.75
you
0.71
你怎么
0.71
你在
0.70
你
0.70
你说
0.69
এর
0.68
为你
0.67
你自己
0.66
Act
ivations
Density 0.051%
Test
Stacked
Snippet
Full
Show Raw Tokens
Show Formatted
Show Breaks
Hide Breaks
No Known Activations