INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

投资

-0.08

工业园

-0.07

	Input

-0.07

 решения

-0.07

顾客

-0.07

 Wohnung

-0.07

陔

-0.06

自如

-0.06

较强的

-0.06

 slightly

-0.06

POSITIVE LOGITS

 stringify

0.08

_proba

0.08

 QModelIndex

0.08

 therefore

0.07

#,

0.07

משפחה

0.07

 Alla

0.07

irteen

0.07

 también

0.06

扪

0.06

Activations Density 0.026%

No Comments

No Known Activations

No Comments

No Known Activations