INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

áo

-0.07

外国

-0.07

BTC

-0.07

 appliance

-0.07

Axe

-0.07

 polls

-0.07

وى

-0.07

 annoyed

-0.07

-long

-0.07

 rounding

-0.07

POSITIVE LOGITS

 ()=>{↵

0.07

{*

0.07

 граф

0.07

.RequestMethod

0.07

 chois

0.06

 hungry

0.06

會員註冊

0.06

碚

0.06

 Corinth

0.06

蒐集

0.06

Activations Density 0.035%

No Comments

No Known Activations

No Comments

No Known Activations