Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

closing brackets

np_acts-logits-general · gemini-2.5-flash-lite

New Auto-Interp

Configuration

google/gemma-scope-2-4b-it/resid_post/layer_29_width_65k_l0_medium

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

<start_of_image>

1.76

')));

0.76

<h2>

0.72

')))

0.70

</h2>

0.67

\"

0.67

。《

0.66

'));

0.64

 "));

0.63

\"{

0.63

POSITIVE LOGITS

5.07

],

4.59

].

4.33

](

4.32

]:

4.30

4.29

.]

4.28

];

4.27

']

4.15

!]

4.05

Activations Density 0.888%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact