Neuronpedia

New: May 2025

Neuronpedia × Anthropic: Circuit Tracer

Generate and share attribution graphs, based on Anthropic's Circuit Tracing paper.

Launch

Neuronpedia is an open source interpretability platform.

Explore, steer, and experiment on AI models.

Gemma Scope

Explore

Browse over four terabytes of activations, explanations, and metadata.
Neuronpedia supports probes, latents/features, custom vectors, concepts, and more.

Releases

A Bunch of Matryoshka SAEs

David Chanin

Llama Scope R1: SAEs for DeepSeek-R1-Distill-Llama-8B

OpenMOSS Team, Fudan University

Gemma Scope - Exploring the Inner Workings of Gemma 2

Language Model Interpretability Team, Google DeepMind

AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders

pyvene.ai, The Stanford NLP Group

Llama Scope: SAEs for Llama-3.1-8B

OpenMOSS Team, Fudan University

Feature Splitting for GPT2-Small

Joseph Bloom

Gated SAE for Llama3-8B-Instruct

Julius Han

Multi TopK SAE for Llama3.1-8B

EleutherAI

Sparse Autoencoder for GPT2-Small - v5

OpenAI

Identifying Functionally Important Features with End-to-End Sparse Dictionary Learning

Apollo Research · Jordan Taylor

Transcoders Enable Fine-Grained Interpretable Circuit Analysis for Language Models

Jacob Dunefsky · Philippe Chlenski

Sparse Autoencoders for Pythia-70M-Deduped

Under Peer Review

Attention SAE Research Paper

Under Peer Review

Open Source Sparse Autoencoders for all Residual Stream Layers of GPT2-Small

Joseph Bloom

Models

DEEPSEEK-R1-LLAMA-8B

DeepSeek-R1-Dist-Llama-8BDeepSeek

GEMMA-2-2B-IT

Gemma-2-2B-ITGoogle Deepmind

Gemma-2-2BGoogle Deepmind

GEMMA-2-9B-IT

Gemma-2-9B-ITGoogle Deepmind

GEMMA-2-9B

Gemma-2-9BGoogle Deepmind

P70M-D

Pythia-70M-DedupedEleutherAI

GPT2-SMALL

GPT2-SmallOpenAI

Search via Inference

Run Example Search

API + Libraries

Neuronpedia hosts the world's first interpretability API (March 2024) - and all functionality is available by API or Python/TypeScript libraries. Most endpoints have an OpenAPI spec and interactive docs.

Inspect

Go in depth on each probe/latent/feature with top activations, top logits, activation density, and live inference testing. All dashboards have unique links, can be compiled into sharable lists, and supports IFrame embedding, as demonstrated here.

Who We Are

Neuronpedia was created by Johnny Lin, an ex-Apple engineer who previously founded a privacy startup. Neuronpedia is supported by Decode Research, the Long Term Future Fund, AISTOF, Anthropic, and others.

Get Involved

Citation

@misc{neuronpedia,
    title = {Neuronpedia: Interactive Reference and Tooling for Analyzing Neural Networks},
    year = {2023},
    note = {Software available from neuronpedia.org},
    url = {https://www.neuronpedia.org},
    author = {Lin, Johnny}
}

Releases

Models

Jump To

Search via Inference