INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 говорит

-0.07

 strstr

-0.06

 Titles

-0.06

 chir

-0.06

UMENT

-0.06

 hitter

-0.06

نتائ

-0.06

 estate

-0.06

 transitioning

-0.06

 Aircraft

-0.06

POSITIVE LOGITS

نقل

0.07

пром

0.07

_PID

0.07

 مدينة

0.07

гре

0.07

 Cùng

0.07

_MD

0.07

构成

0.06

POW

0.06

 请求

0.06

Activations Density 0.013%

No Comments

No Known Activations

No Comments

No Known Activations