© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact

Neuronpedia

Natural Language

NEW Assistant AxisNEW Circuit TracerUPDATESteer SAE Evals ExportsAPI Community Blog Privacy & Terms Contact

GPT2-SMALL · 8-TRES-DC · 248 ｜ Neuronpedia

Home
GPT2-Small
Transcoders Residuals
8-TRES-DC
248

INDEX

Explanations

references to online platforms and social media interactions

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Top Features by Cosine Similarity

Embeds

Show PlotsShow ExplanationShow ActivationsShow Test FieldShow SteerShow Link

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ippery

-0.80

BuyableInstoreAndOnline

-0.73

htaking

-0.68

 twilight

-0.68

erville

-0.67

estones

-0.67

abus

-0.66

apple

-0.65

 waning

-0.65

emonium

-0.65

POSITIVE LOGITS

ESE

0.79

eez

0.70

oS

0.68

IGN

0.67

OTO

0.65

ASE

0.65

INST

0.64

ribe

0.64

CHAT

0.64

 edit

0.64

Activations Density 2.695%

No Known Activations