OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
The neuron is broadly detecting common English function words—especially auxiliary and connective tokens like “is,” “to,” “would,” “if,” “it,” “but,” and similar.
This neuron strongly activates on tokens containing the substring “ote,” i.e. words ending in or including “ote” (e.g. epimastigote, creosote, heterozygote, etc.).
This neuron detects instances of reported speech or dialogue attribution—especially sentences that begin with a pronoun (He/She) introducing what someone says or does, often marked by a colon.
o4-mini
beautiful: He tells her he loves her and makes her
The neuron fires on programming‐style identifiers—especially CamelCase API or framework names and properties (e.g. ModelState, IsValid, navigationItem, TempData).
o4-mini
it possible to apply ModelState.IsValid to just one
The neuron selectively detects the occurrence of the term “response” (especially in phrases like “response of” or “response to”) in clinical or experimental treatment contexts.
o4-mini
q/liter. The response of the serum potassium level