Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

oreach

-0.07

힝

-0.07

ered

-0.06

ѽ

-0.06

unable

-0.06

град

-0.06

hem

-0.06

(".");↵

-0.06

 STILL

-0.06

Me

-0.06

POSITIVE LOGITS

便利

0.07

嫁给

0.06

🐁

0.06

(NULL

0.06

WOW

0.06

 compounded

0.06

옂

0.06

	reply

0.06

cpf

0.06

 lies

0.06

Activations Density 0.343%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact