OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
unusual or malformed tokens and structural irregularities within text content.
claude-4-5-haiku
jquery.com/jquery-3.5.1
structured data elements and list entries, particularly names and values within JSON, tables, or numbered lists.
claude-4-5-haiku
},↵{↵"name": "
model response headers or instructional content markers across multiple languages.
claude-4-5-haiku
ktronikprojekte:** Experimentieren mit Sensoren,
specific automobile model years, designations, and their associated color or visual characteristics.
claude-4-5-haiku
11S, finished in Signal Orange, should be
user prompts requesting customization, role-playing, or instructions on how the model should behave.
claude-4-5-haiku
ultuda yanıtla lütfen."↵↵**Ancak unut
This neuron activates on terms related to resignations, particularly when details like names, dates, or reasons for the resignation are mentioned.
gemini-2.5-flash
**June17,2023
bolded idiomatic expressions and example phrases highlighted for definitional or instructional purposes.
claude-4-5-haiku
7, with someone always **on duty**."↵
dates formatted as MM/DD/YYYY.
deepseek-v3
:↵A) Useahighalpha
# Explanation of neuron 4 behavior: the main thing this neuron does is find
**numeric values and date/time-related tokens** that appear in structured documents or formatted lists.