© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact

Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

Home
Andy Arditi · GPT-OSS BatchTopK SAEs
GPT-OSS-20B
Resid Post - 131k
11-RESID-POST-AA
78277

INDEX

Explanations

具体

np_max-act · gemini-2.0-flash

New Auto-Interp

Top Features by Cosine Similarity

Configuration

andyrdt/saes-gpt-oss-20b/resid_post_layer_11/trainer_0

Dataset (Dashboard)

Various

No Configuration Found

Embeds

Show PlotsShow ExplanationShow ActivationsShow Test FieldShow SteerShow Link

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Lonely

-0.08

 Beaches

-0.08

uc

-0.07

 Rita

-0.07

icais

-0.07

 출장

-0.07

택

-0.07

 Stanley

-0.07

 Daisy

-0.07

 vastly

-0.07

POSITIVE LOGITS

ніше

0.08

 details

0.08

 detall

0.07

落实

0.07

 detal

0.07

 التفاصيل

0.07

 detailing

0.07

 detail

0.07

�

0.07

俗

0.07

Activations Density 0.008%

No Known Activations