Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

settings and instructions

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-gpt-oss-20b/resid_post_layer_11/trainer_0

Dataset (Dashboard)

Various

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Qed

-0.08

interpret

-0.07

Matthew

-0.07

 मेल

-0.07

void

-0.07

וח

-0.07

 fills

-0.07

Packed

-0.07

 condol

-0.07

INFRINGEMENT

-0.07

POSITIVE LOGITS

.toggle

0.16

Toggle

0.16

 togg

0.16

 переключ

0.16

 Toggle

0.16

.Toggle

0.16

_toggle

0.16

 toggle

0.15

toggle

0.15

-toggle

0.14

Activations Density 0.010%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact