Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

google/gemma-scope-2-12b-it/resid_post/layer_12_width_262k_l0_medium

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 ак

0.82

ರ್ಧ

0.77

<0x98>

0.75

<0x95>

0.74

ischer

0.74

 effett

0.73

𝘺

0.73

ло

0.72

 allez

0.72

ITION

0.72

POSITIVE LOGITS

şi

0.77

ć

0.76

 Städ

0.75

en

0.73

会导致

0.72

puede

0.71

it

0.71

悝

0.70

看待

0.70

controller

0.70

Activations Density 0.000%

No Known Activations

© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact