OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
Neuron 1: looks for actions or phrases in present participle form describing ongoing, dynamic activity.
Neuron 2: looks for terms related to medical conditions or health risks.
Neuron 3: looks for phrases about a sense of community and collective togetherness.
Neuron 4: looks for content that promotes engagement and activism through social media and citizen journalism.
gpt-5-nano
an African-American student of stealing his brother’s jacket.↵
conversational openings and direct questions—often about identity/definitions or expressing worry about crime victimization—across both English and Chinese.