Neuronpedia
Get Started
API
Releases
Jump To
Search
Models
Steer
SAE Evals
Blog/Podcast
NEW
Slack
Privacy & Terms
Contact
Sign In
© Neuronpedia 2025
Privacy & Terms
Blog/Podcast
GitHub
Slack
Twitter
Contact
EXPLANATION TYPE
oai_token-act-pair
Description
OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
Author
OpenAI
URL
https://github.com/hijohnnylin/automated-interpretability
Settings
Default prompts from the main branch, strategy TokenActivationPair.
Recent Explanations
product or service descriptions highlighting features, options, or offerings.
claude-3-7-sonnet-20250219
for
mang
os
providing
enhanced
features
for
↵
*
area
triggers
GEMMA-2-9B-IT
20-AXBENCH-REFT-R1-RES-16K
INDEX 2740
the beginning of explanatory statements, particularly those starting with "This" in technical documentation.
claude-3-7-sonnet-20250219
p
2
/
pool
.
This
is
because
when
installing
it
GPT2-SMALL
1-MLP-OAI
INDEX 7599
The neuron detects forms of the word "said" when reporting speech or quotations.
claude-3-7-sonnet-20250219
Lo
L
:
what
i
said
before
is
what
people
said
GPT2-SMALL
1-TRES-DC
INDEX 4046
lists within a document, especially when formatted with indentation or bullet points.
claude-3-7-sonnet-20250219
us
l
(
200
2
)↵
Blue
Moon
–
Als
Wer
LLAMA3.1-8B
25-LLAMASCOPE-RES-32K
INDEX 270
descriptions of group violence or aggressive actions.
claude-3-5-sonnet-20240620
↵↵
His
shocked
fellow
soldiers
raised
the
alarm
and
paramedics
,
GEMMA-2B-IT
12-RES-JB
INDEX 1765
violent physical altercations or assaults.
claude-3-5-haiku-20241022
↵↵
His
shocked
fellow
soldiers
raised
the
alarm
and
paramedics
,
GEMMA-2B-IT
12-RES-JB
INDEX 1765
descriptions of group violence or aggression where people attack, beat, or threaten someone.
claude-3-7-sonnet-20250219
↵↵
His
shocked
fellow
soldiers
raised
the
alarm
and
paramedics
,
GEMMA-2B-IT
12-RES-JB
INDEX 1765
prepositions like "of" and "from" followed by materials or physical substances.
claude-3-7-sonnet-20250219
a
variety
of
metals
-
including
platinum
and
gold
.
GPT2-SMALL
9-TRES-DC
INDEX 622
online forum or comment-style text that includes informal personal statements or opinions.
claude-3-7-sonnet-20250219
6
.
↵↵
You
guy
make
it
sound
like
the
end
GEMMA-2-9B
20-RES-MATRYOSHKA-DC
INDEX 7415
technical and scientific terminology, particularly related to technological systems and devices.
claude-3-5-haiku-20241022
light
source
1
is
focused
as
a
linear
image
near
GEMMA-2-9B
20-RES-MATRYOSHKA-DC
INDEX 383
details related to time periods, particularly years and dates in the format of 4-digit years (e.g. 1998, 2005) or specific dates (e.g. May 5, 2005).
claude-3-5-sonnet-20240620
light
source
1
is
focused
as
a
linear
image
near
GEMMA-2-9B
20-RES-MATRYOSHKA-DC
INDEX 383
technical descriptions or specifications in formal documents, especially focusing on numerical measurements and technical terminology.
claude-3-7-sonnet-20250219
light
source
1
is
focused
as
a
linear
image
near
GEMMA-2-9B
20-RES-MATRYOSHKA-DC
INDEX 383
The neuron activates for punctuation marks, especially periods at the end of paragraphs or sentences.
claude-3-7-sonnet-20250219
for
the
half
-w
itted
.
The
decision
of
one
,
LLAMA3.1-8B
5-LLAMASCOPE-RES-32K
INDEX 2791
complete declarative sentences that state facts or describe information.
claude-3-7-sonnet-20250219
required
scope
and
quality
targets
.
They
are
instrumental
in
ensuring
GEMMA-2B-IT
12-RES-JB
INDEX 2297
fragments of text from online forums, blogs, and comments, particularly in casual conversation contexts.
claude-3-7-sonnet-20250219
with
even
the
t
ini
est
peripheral
devices
.
The
cable
LLAMA3.1-8B
29-LLAMASCOPE-MLP-131K
INDEX 5026
phrases introducing reported speech or claims, particularly using "it appears" or "it seems".
claude-3-5-haiku-20241022
.
↵↵
Unfortunately
,
it
appears
that
the
Polish
minister
does
GEMMA-2B
10-RES-JB
INDEX 6688
the phrase "it appears" at the beginning of sentences.
claude-3-7-sonnet-20250219
.
↵↵
Unfortunately
,
it
appears
that
the
Polish
minister
does
GEMMA-2B
10-RES-JB
INDEX 6688
academic research-related concepts and terminology.
claude-3-7-sonnet-20250219
communities
continues
to
motivate
ecological
research
[@
pone
.
0
0
GEMMA-2-9B-IT
20-AXBENCH-REFT-R1-RES-16K
INDEX 1195
C-style programming keywords and code-related terminology.
claude-3-7-sonnet-20250219
B
SL
A
_MAY
BE
_UNUSED
double
cube
Factor
,↵
DEEPSEEK-R1-DISTILL-LLAMA-8B
15-LLAMASCOPE-SLIMPJ-OPENR1-RES-32K
INDEX 777
descriptions of unusual or unexpected sightings or encounters, particularly related to mysterious objects or creatures.
claude-3-5-sonnet-20240620
driver
when
suddenly
he
encountered
a
explosion
along
the
side
of
LLAMA3.1-8B
30-LLAMASCOPE-RES-32K
INDEX 8441