INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

сроч

-0.08

frau

-0.08

リスト

-0.08

.findall

-0.08

 queryString

-0.07

 pharmacist

-0.07

商务

-0.07

_FIN

-0.07

 filenames

-0.07

(cli

-0.07

POSITIVE LOGITS

 Evidence

0.07

 cheers

0.07

🐷

0.07

ump

0.07

#@

0.06

 incredible

0.06

nty

0.06

онт

0.06

 wing

0.06

 eget

0.06

Activations Density 0.393%

No Comments

No Known Activations