OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
technical syntax and symbol-heavy tokens in code or formatted text, especially XPath expressions like following-sibling and similar structured snippets.
descriptions of artificial intelligence and futuristic science-fiction scenarios, especially space colonization, advanced technology, and formal techno-policy or military-style discourse.
gpt-5
strating* it. After the disastrous early attempts at
scripted multi-speaker dialogue structure, especially speaker turn labels and direct-address conversational turns typical of roleplay or staged conversations.
structural and discourse cues of the model’s step-by-step math solution (e.g., response headers, newlines/section breaks, and procedural lead-ins indicating the start of an explanation).
gpt-5
?<end_of_turn>↵<start_of_turn>model↵We are given two equations