INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 ServiceException

-0.07

纵向

-0.07

ꦆ

-0.07

(datas

-0.07

 coordinator

-0.07

 Israelis

-0.07

 Friendship

-0.07

 Classification

-0.07

 widening

-0.07

(nextProps

-0.07

POSITIVE LOGITS

Little

0.06

锂

0.06

食

0.06

uglify

0.06

رام

0.06

adol

0.06

아

0.06

()
↵
↵

0.06

 знать

0.06

應

0.06

Activations Density 0.011%

No Comments

No Known Activations

No Comments

No Known Activations