© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact

Neuronpedia

Jacobian LensNEW

Natural Language

NEW Assistant AxisNEW Circuit TracerUPDATESteer SAE Evals ExportsAPI Community Blog Privacy & Terms Contact

Home
Andy Arditi · GPT-OSS BatchTopK SAEs
GPT-OSS-20B
Resid Post - 131k
11-RESID-POST-AA
30607

INDEX

Explanations

,

np_max-act · gemini-2.0-flash

New Auto-Interp

Top Features by Cosine Similarity

Configuration

andyrdt/saes-gpt-oss-20b/resid_post_layer_11/trainer_0

Dataset (Dashboard)

Various

No Configuration Found

Embeds

Show PlotsShow ExplanationShow ActivationsShow Test FieldShow SteerShow Link

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 يتعلق

-0.08

声明

-0.08

 девушки

-0.08

驻

-0.08

ಬ್ಬ

-0.08

 hazırlan

-0.08

 แขวง

-0.08

ක්ෂ

-0.07

 recipiente

-0.07

 милл

-0.07

POSITIVE LOGITS

secure

0.07

sn

0.07

 Burton

0.07

plus

0.06

очно

0.06

 Lind

0.06

jo

0.06

icc

0.06

Arithmetic

0.06

 critically

0.06

Activations Density 1.363%

No Known Activations