INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Importer

-0.08

,\↵

-0.08

 nominee

-0.08

 stump

-0.07

 neurotrans

-0.07

 depos

-0.07

 bookmark

-0.07

 пользу

-0.07

 avoids

-0.07

 Impossible

-0.07

POSITIVE LOGITS

相传

0.06

歌声

0.06

怎么说

0.06

簡�

0.06

晃

0.06

jective

0.06

Ɓ

0.06

 math

0.06

下列

0.06

"&

0.06

Activations Density 0.224%

No Comments

No Known Activations

No Comments

No Known Activations