OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
Default prompts from the main branch, strategy TokenActivationPair.
Recent Explanations
instructional verb-phrase constructions that direct someone to undertake an action or allocate time/care, especially in advice or step-by-step guidance contexts.
gpt-5
that a little alarming. Take your time pumpkin, no
mathematical and technical formatting markup, particularly LaTeX symbols, asterisks for emphasis, and special characters used in academic or technical documents.