© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact

Neuronpedia

Natural Language

NEW Assistant AxisNEW Circuit TracerUPDATESteer SAE Evals ExportsAPI Community Blog Privacy & Terms Contact

Home
GPT2-Small
6
1358

INDEX

Explanations

parts before the apostrophe in negative forms of the verb: "hadn" in hadn't, "wouldn" in wouldn't, "wasn" in wasn't

Explanation Uploaded by User

contractions with a negative word, such as "isn't" or "aren't."

oai_token-act-pair · gpt-4-turbo

New Auto-Interp

Top Features by Cosine Similarity

Embeds

Show PlotsShow ExplanationShow ActivationsShow Test FieldShow SteerShow Link

IFrame

Link

Not in Any Lists

No Comments

No Known Activations