Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

closing superscript tags

np_acts-logits-general · gemini-2.5-flash-lite

New Auto-Interp

Configuration

google/gemma-scope-2-4b-it/resid_post/layer_29_width_65k_l0_medium

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

![](

0.85

}^{*}\

0.79

}^{*},

0.79

}^\

0.79

$\\

0.77

}^{(

0.77

-\\

0.76

}^

0.76

]\\

0.75

]-\

0.73

POSITIVE LOGITS

</sup>

2.01

"]}

1.16

</u>

1.06

"]},

1.01

</span>

0.85

</h3>

0.83

</h4>

0.82

']}

0.79

*/}

0.79

"}

0.77

Activations Density 0.099%

No Known Activations

© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact