© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact

Neuronpedia

Natural Language

NEW Assistant AxisNEW Circuit TracerUPDATESteer SAE Evals ExportsAPI Community Blog Privacy & Terms Contact

Home
GPT2-Small
Transcoders Residuals
8-TRES-DC
62

INDEX

Explanations

sentence-ending punctuation or periods

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Top Features by Cosine Similarity

Embeds

Show PlotsShow ExplanationShow ActivationsShow Test FieldShow SteerShow Link

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

tube

-0.70

VPN

-0.62

 neighb

-0.61

rieved

-0.61

bath

-0.58

illance

-0.57

etimes

-0.56

rowd

-0.54

outube

-0.53

↑

-0.53

POSITIVE LOGITS

ナ

0.67

 Instrument

0.67

 Inspired

0.65

 Feet

0.63

 Ashes

0.63

 Normally

0.62

ン

0.62

ック

0.62

atra

0.61

 Alright

0.60

Activations Density 0.245%

No Known Activations