Neuronpedia

Neuronpedia

APICircuit TracerNEW Steer SAE Evals Exports Slack Blog Privacy & Terms Contact

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact

EXPLANATION TYPE

oai_token-act-pair

Description

OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.

Author

OpenAI

URL

https://github.com/hijohnnylin/automated-interpretability

Settings

Default prompts from the main branch, strategy TokenActivationPair.

Recent Explanations

formal, institutional language—especially abstract nouns and titles tied to legal or religious authority, often accompanied by concessive transitions like nonetheless or nevertheless

gpt-5

of the option; it nonetheless retains its essential characteristic as

0-GEMMASCOPE-MLP-16K

mentions of hats, especially the exact word or its plural, including when embedded in longer terms or phrases.

gpt-5

, and the type of hat they were wearing. When

7-TRANSCODER-HP

publication years and four-digit dates, especially within academic citations and reference metadata.

gpt-5

. 2020;24:2

23-TRANSCODER-HP

occurrences of the character sequence “hy” (especially as a capitalized prefix or standalone token) within words and names.

gpt-5

templateid1\listhybrid{\listlevel\

23-TRANSCODER-HP

formal legal case captions and appellate court headers indicating jurisdiction, parties, and orders in U.S. court documents.

gpt-5

94th Judicial District Court↵ Dallas County,

23-TRANSCODER-HP

capitalized proper names, especially surnames and eponymous terms, appearing in technical or news text.

gpt-5

assessment included the Wechsler Intelligence Scale for Children-third

7-TRANSCODER-HP

references to immediate family relationships, especially mentions of parents and their children.

gpt-5

little about the horrors their parents witnessed or perpetrated." You

23-TRANSCODER-HP

phrases that describe something as occurring in or derived from nature, typically using an adjective before a noun in scientific or technical contexts.

gpt-5

8 in the second had natural heart valves, while

23-TRANSCODER-HP

verbs indicating concrete actions taken by someone (often the author) to do, create, or try something, especially in technical/problem‑solving contexts.

gpt-5

a bash, then I wrote a simple bash like this

23-TRANSCODER-HP

mentions of the English language or “en” locale in language/locale metadata and related labels.

gpt-5

rizione:↵↵Language: English . Brand New Book.

23-TRANSCODER-HP

references to the lungs and pulmonary system in medical or anatomical contexts.

gpt-5

reduce your risk.↵↵Lung Cancer Causes Without Smoking↵↵

7-TRANSCODER-HP

forms of the verb “to be” (including contracted forms) used as auxiliaries or copulas.

gpt-5

We can guess that she's been craving them, but

7-TRANSCODER-HP

references to constructing or developing something, especially in “to build” verb phrases across technical or organizational contexts.

gpt-5

learn essential skills needed to build apps for Android.↵↵The

7-TRANSCODER-HP

mentions of software UI tab navigation, often activating on the three-letter sequence appearing alone or embedded within longer terms.

gpt-5

in a new window or tab and exceptions – opens in

7-TRANSCODER-HP

questions or statements about whether something has ever occurred (lifetime experience, including negations like “not at any time”).

gpt-5

rate. But have you ever wondered just how the financial

7-TRANSCODER-HP

superlative-degree constructions, particularly phrases indicating something is among the top or best within a category.

gpt-5

’s voice remains one of the strongest. In an article

7-TRANSCODER-HP

uses of the adjective denoting completeness/entirety, including capitalized occurrences as part of proper names or titles.

gpt-5

. One aspect of the total toilet training process is the

7-TRANSCODER-HP

references to outcomes or findings—mentions of the result of an action, experiment, query, or study.

gpt-5

. Marketing and innovation produce results, all the rest are

7-TRANSCODER-HP

mentions of “profile” and closely related profile-page or profile-metadata terms, especially in account, biographical, or listing contexts.

gpt-5

. To ensure reliability, profile information of each included article

11-GEMMASCOPE-RES-16K

references to complex rhythmic structures in music theory.

gpt-5

syncopation, complex polyrhythms,

23-RESID-POST-AA