Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

google/gemma-scope-2-4b-it/transcoder_all/layer_5_width_262k_l0_small_affine

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

𝑎

2.26

⎢

2.15

𝑏

2.13

 unleashed

2.13

𝑑

1.99

𝑠

1.97

 loosely

1.95

্টি

1.94

้

1.93

ीकरण

1.91

POSITIVE LOGITS

ד

2.01

et

2.00

に

1.89

getting

1.87

ﻨ

1.83

 distinguishing

1.81

giveness

1.81

色列

1.80

ට

1.79

joh

1.79

Activations Density 0.000%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact