OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
the neuron responds to content-bearing or topical words (important nouns, verbs, pronouns and discourse markers) rather than function or filler tokens.
gpt-5-mini
in response to the ever-changing demands of the modern
the neuron highlights salient, information-dense tokens—important content words (main verbs, nouns, numbers) and emphatic punctuation that carry the core facts or claims.
gpt-5-mini
U.S. prisoners have been released from North Korea