INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

(lbl

-0.07

	sound

-0.07

\Abstract

-0.07

precation

-0.07

גרפי

-0.07

 asphalt

-0.07

 przez

-0.07

 Shirt

-0.07

Eat

-0.07

 screams

-0.07

POSITIVE LOGITS

铊

0.07

(",")↵

0.07

 Trail

0.07

💗

0.07

😍

0.06

带

0.06

.get

0.06

主力

0.06

client

0.06

.persistence

0.06

Activations Density 0.002%

No Comments

No Known Activations