© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact

Neuronpedia

Jacobian LensNEW

Natural Language

NEW Assistant AxisNEW Circuit TracerUPDATESteer SAE Evals ExportsAPI Community Blog Privacy & Terms Contact

Home
Under Peer Review · Sparse Autoencoders for Pythia-70M-Deduped
Pythia-70M-Deduped
MLP Post
0-MLP-SM
27352

INDEX

Explanations

rhetorical questions

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Top Features by Cosine Similarity

Configuration

ctigges/pythia-70m-deduped__mlp-sm_processed/0-mlp-sm

Prompts (Dashboard)

32,768 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

32,768

Data Type

torch.float32

Hook Name

blocks.0.hook_mlp_out

Hook Layer

0

Architecture

standard

Context Size

128

Dataset

EleutherAI/the_pile_deduplicated

Activation Function

relu

Embeds

Show PlotsShow ExplanationShow ActivationsShow Test FieldShow SteerShow Link

IFrame

Link

Not in Any Lists

No Comments

Neuron Alignment

Index

Value

% of L₁

78

+0.13

0.7%

511

+0.12

0.7%

466

+0.12

0.7%

Correlated Neurons

Index

P. Corr.

Cos Sim.

133

+0.13

0.02

309

+0.12

0.02

27

+0.12

0.02

Negative Logits

¹

-2.81

¯

-2.44

Ń

-2.43

®

-2.42

¤

-2.41

¸

-2.39

³

-2.36

¶

-2.36

½

-2.32

¼

-2.31

POSITIVE LOGITS

ioned

1.77

oons

1.66

oon

1.55

chen

1.53

outh

1.50

ourt

1.39

omology

1.37

eros

1.35

ionate

1.34

sed

1.33

Activations Density 0.005%

No Known Activations