INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

fits

-0.07

/tr

-0.07

Cow

-0.07

/control

-0.06

рож

-0.06

므로

-0.06

]))
↵

-0.06

-points

-0.06

 Beatles

-0.06

等活动

-0.06

POSITIVE LOGITS

折叠

0.07

 obstruction

0.07

 актив

0.07

 labeling

0.07

_encoder

0.07

 predict

0.06

 prophet

0.06

 skins

0.06

 obstruct

0.06

渑

0.06

Activations Density 0.041%

No Comments

No Known Activations