Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

google/gemma-scope-2-1b-pt/resid_post/layer_13_width_16k_l0_medium

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

NO

1.78

!!

1.70

!!!

1.60

1.52

 !!!!

1.48

1.47

:-

1.37

 !!!!!

1.35

1.35

::

1.35

POSITIVE LOGITS

ແລະ

1.94

ették

1.89

 পাওয়

1.80

ल्यावर

1.77

ând

1.73

nię

1.73

annya

1.66

पहरण

1.65

ंती

1.64

และ

1.64

Activations Density 0.000%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact