Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

Golden

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ana

-1.07

ANA

-0.99

gap

-0.82

 CreateTagHelper

-0.77

gap

-0.77

AndEndTag

-0.75

OGND

-0.74

 GOLDEN

-0.73

Golden

-0.71

 Cæsar

-0.71

POSITIVE LOGITS

0.56

0.53

Ba

0.52

Lu

0.50

Ad

0.49

0.49

Pan

0.48

Ra

0.48

Ab

0.47

Br

0.47

Activations Density 0.281%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact