Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

multiple languages

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-gpt-oss-20b/resid_post_layer_11/trainer_0

Dataset (Dashboard)

Various

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

<|channel|>

-0.08

Walter

-0.08

ench

-0.07

[]↵

-0.07

gyr

-0.07

Ta

-0.07

 પ્રમાણે

-0.07

Taking

-0.07

मा

-0.07

 Kreuz

-0.07

POSITIVE LOGITS

」

0.10

"，

0.10

」という

0.10

」の

0.10

”

0.09

」，

0.09

”的

0.09

<|reserved_200004|>

0.09

」を

0.09

》，

0.08

Activations Density 1.992%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact