INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Parallel

-0.07

 computer

-0.06

/files

-0.06

 Wikipedia

-0.06

Lock

-0.06

Enter

-0.06

 Warranty

-0.06

Further

-0.06

هر

-0.06

Distinct

-0.06

POSITIVE LOGITS

 Robertson

0.08

抗战

0.07

 enqu

0.07

防晒

0.07

 dass

0.07

trg

0.07

ange

0.07

(LogLevel

0.07

 ratt

0.06

 QVector

0.06

Activations Density 0.005%

No Comments

No Known Activations

No Comments

No Known Activations