Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

comments in code

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-gpt-oss-20b/resid_post_layer_11/trainer_0

Dataset (Dashboard)

Various

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 исслед

-0.08

]);↵↵

-0.08

 Kenntnisse

-0.08

 시간

-0.08

 pesquisadores

-0.08

 phosphorus

-0.07

iril

-0.07

 oste

-0.07

EXT

-0.07

 গবেষ

-0.07

POSITIVE LOGITS

ದೆ

0.08

氏

0.08

¦

0.08

ස

0.08

�

0.08

共

0.08

�

0.08

 njega

0.08

ходзіць

0.08

ेव

0.08

Activations Density 0.000%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact