Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

cities

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

mwhanna/qwen3-4b-transcoders/layer_23.safetensors

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

163,840

Data Type

float32

Hook Name

blocks.23.mlp.hook_in

Architecture

transcoder

Context Size

8,192

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ardi

-0.27

—that

-0.27

pedia

-0.25

esda

-0.25

','$

-0.25

',{↵

-0.24

sip

-0.24

ARDS

-0.24

æī£éĻ¤

-0.24

ernote

-0.23

POSITIVE LOGITS

0.60

:t

0.44

:<?

0.42

:T

0.42

:L

0.40

:(

0.40

:!

0.39

:S

0.38

ï¼ļ

0.38

:B

0.38

Activations Density 0.004%

No Known Activations

© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact