Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

google/gemma-scope-2-1b-pt/resid_post/layer_13_width_16k_l0_medium

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

trom

1.74

يكم

1.67

্যাথ

1.64

ي

1.57

ნიშვნ

1.49

raten

1.48

يك

1.44

了

1.44

rond

1.44

tal

1.41

POSITIVE LOGITS

𝕤

1.80

 peaches

1.78

 locate

1.59

 eyelids

1.54

ⓝ

1.54

pie

1.53

𝕟

1.51

 fairy

1.50

 beautiful

1.49

 nonexistent

1.49

Activations Density 0.000%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact