Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

`__` in code context

np_acts-logits-general · gemini-2.5-flash-lite

New Auto-Interp

Configuration

google/gemma-scope-2-4b-it/resid_post/layer_29_width_262k_l0_medium

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 *****

0.98

◆

0.97

\*

0.96

0.96

☆

0.86

＊

0.86

 .......

0.85

 ******

0.84

)}}\

0.84

¡

0.84

POSITIVE LOGITS

__)

1.69

//}

1.51

__,

1.45

__.

1.44

/)

1.44

__()

1.38

__

1.36

__(

1.35

__":

1.34

__["

1.32

Activations Density 0.191%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact