INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

.head

-0.07

 Geschichte

-0.07

Duo

-0.07

 Loki

-0.07

.Username

-0.07

 Kanye

-0.07

 republic

-0.07

 prac

-0.07

 navCtrl

-0.07

cin

-0.07

POSITIVE LOGITS

 Peterson

0.08

Pet

0.08

 petroleum

0.08

Pet

0.08

0.07

ventional

0.07

'%"

0.07

 ?>>↵

0.07

嬰

0.07

'|'

0.07

Activations Density 0.021%

No Comments

No Known Activations