Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

say "}"

np_max-act-logits · gemini-2.5-flash-lite

New Auto-Interp

Configuration

google/gemma-scope-2-27b-it/transcoder_all/layer_56_width_262k_l0_small_affine

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

=",

0.56

0.56

="

0.50

0.50

=<

0.49

$=$

0.48

="+

0.48

|=

0.46

$=\

0.46

+=

0.46

POSITIVE LOGITS

 मुर्

0.38

 ഫോ

0.37

 RoHS

0.37

 TRIG

0.37

曠

0.36

 soluzioni

0.36

 engel

0.36

 बहिष्कार

0.35

 jouent

0.35

 따라

0.35

Activations Density 0.002%

No Known Activations

© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact