Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

Calculations

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-gpt-oss-20b/resid_post_layer_15/trainer_0

Dataset (Dashboard)

Various

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

:↵↵

-0.11

:↵↵↵

-0.11

:↵↵↵↵

-0.10

.↵↵↵

-0.09

/list

-0.08

">↵↵↵

-0.08

 {↵↵↵

-0.08

—and

-0.08

.↵↵↵↵

-0.08

.*↵↵

-0.07

POSITIVE LOGITS

”；

0.09

"):

0.09

'):

0.09

 هذا

0.09

طانيا

0.09

 المنام

0.09

_Component

0.08

 בעוד

0.08

 honetan

0.08

Afr

0.08

Activations Density 0.289%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact