© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact

Neuronpedia

Natural Language

NEW Assistant AxisNEW Circuit TracerUPDATESteer SAE Evals ExportsAPI Community Blog Privacy & Terms Contact

Home
GPT2-Small
6
1000

INDEX

Explanations

tokens strongly identifying the source as related to science fiction or fantasy media reviews. selects for the word favorite x5 as indicator of review

Explanation Uploaded by User

words and phrases related to ratings, evaluations, and comparisons.

oai_token-act-pair · gpt-4-turbo

New Auto-Interp

Top Features by Cosine Similarity

Embeds

Show PlotsShow ExplanationShow ActivationsShow Test FieldShow SteerShow Link

IFrame

Link

Not in Any Lists

No Comments

No Known Activations