INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

รวบ

-0.08

Bell

-0.08

 necklace

-0.07

 transformer

-0.07

诊断

-0.07

 noodles

-0.07

整改

-0.07

 display

-0.06

dec

-0.06

 squeezing

-0.06

POSITIVE LOGITS

 titular

0.08

 çalışmalar

0.08

achu

0.07

.STRING

0.07

的各种

0.07

/Register

0.07

 FLOAT

0.07

;)

0.07

 sust

0.07

筹集

0.07

Activations Density 0.061%

No Comments

No Known Activations