© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact

Neuronpedia

Natural Language

NEW Assistant AxisNEW Circuit TracerUPDATESteer SAE Evals ExportsAPI Community Blog Privacy & Terms Contact

Home
GPT2-Small
Transcoders Residuals
8-TRES-DC
28

INDEX

Explanations

repetitive concepts and actions

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Top Features by Cosine Similarity

Embeds

Show PlotsShow ExplanationShow ActivationsShow Test FieldShow SteerShow Link

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

foundland

-0.62

 incl

-0.61

 facult

-0.61

-0.60

iosyncr

-0.58

ppo

-0.58

ife

-0.57

][/

-0.57

osi

-0.57

 Zeit

-0.57

POSITIVE LOGITS

sung

0.77

AS

0.63

 LAPD

0.59

otin

0.59

monds

0.58

 Myanmar

0.58

growth

0.57

prises

0.56

UC

0.55

vati

0.55

Activations Density 0.057%

No Known Activations