INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_23/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.23.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

鲐

-0.07

:text

-0.07

derive

-0.07

丰田

-0.07

科技园

-0.07

 Tommy

-0.07

<dyn

-0.07

redirectToRoute

-0.07

Tony

-0.07

莶

-0.07

POSITIVE LOGITS

דוגמא

0.07

还是比较

0.06

 guaranteed

0.06

 perm

0.06

 apparel

0.06

 citation

0.06

itations

0.06

 participación

0.06

原标题

0.06

 councill

0.06

Activations Density 0.066%

No Comments

No Known Activations

No Comments

No Known Activations