Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

LaTeX mathematical expressions

np_acts-logits-general · gemini-2.5-flash-lite

New Auto-Interp

Configuration

google/gemma-scope-2-27b-pt/resid_post/layer_53_width_65k_l0_medium

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

利亚

0.38

 feria

0.38

🠀

0.38

 कक्कड़

0.37

𐰴

0.37

 सीक्व

0.37

बानी

0.36

)`;

0.36

☚

0.36

 Residents

0.36

POSITIVE LOGITS

\,

0.68

0.66

{\

0.66

_{\

0.62

\,\

0.61

\;

0.58

}%

0.57

$}

0.55

}}

0.54

)}

0.53

Activations Density 0.000%

No Known Activations

© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact