INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

BOSE

-0.08

 وهذه

-0.07

Ț

-0.07

 nelle

-0.07

 çek

-0.07

 tenía

-0.07

 alunos

-0.07

quí

-0.07

 있어

-0.07

 Então

-0.07

POSITIVE LOGITS

()
↵

0.08

 upsetting

0.08

}.

0.07

读者

0.07

.Handled

0.07

 Javascript

0.07

 dashed

0.07

.parameter

0.06

 rounding

0.06

(.

0.06

Activations Density 0.000%

No Comments

No Known Activations

No Comments

No Known Activations