Neuronpedia
Get Started
API
Releases
Jump To
Search
Models
Circuit Tracer
NEW
Steer
SAE Evals
Blog
Slack
Privacy & Terms
Contact
Sign In
© Neuronpedia 2025
Privacy & Terms
Blog
GitHub
Slack
Twitter
Contact
Home
GPT2-Small
6
507
Prev
Next
MODEL
6
INDEX
Go
Explanations
words like "until", "but", "when", "before", and to a lesser extent the words they're contrasting
Explanation Uploaded by User
No Scores
the words'until' or 'when'
Explanation Uploaded by User
No Scores
" when", "who", and similar
Explanation Uploaded by User
No Scores
shifts in personal circumstances
Explanation Uploaded by User
No Scores
continuity and contradiction
Explanation Uploaded by User
No Scores
Description of a change from a prior state
Explanation Uploaded by User
No Scores
pivotal moments defining identity
Explanation Uploaded by User
No Scores
adverbs or qualifiers
Explanation Uploaded by User
No Scores
transition or shift
Explanation Uploaded by User
No Scores
personal descriptions and people's identities.
oai_token-act-pair · gpt-4-turbo
No Scores
New Auto-Interp
AutoInterp Type
claude-3-5-haiku-20241022
Generate
Top Features by Cosine Similarity
Embeds
Plots
Explanation
Show Test Field
Default Test Text
IFrame
<iframe src=https://www.neuronpedia.org/gpt2-small/6/507?embed=true&embedexplanation=true&embedplots=true&embedtest=true" title="Neuronpedia" style="height: 300px; width: 540px;"></iframe>
Link
https://www.neuronpedia.org/gpt2-small/6/507?embed=true&embedexplanation=true&embedplots=true&embedtest=true
Not in Any Lists
Add to List
▼
No Comments
ADD
Stacked
Snippet
Full
Show Breaks
Hide Breaks
No Known Activations