INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

⬢

-0.09

.setDefault

-0.07

LogFile

-0.07

 Capcom

-0.07

cal

-0.07

くなりました

-0.07

 חוד

-0.06

abic

-0.06

وبة

-0.06

.Arrays

-0.06

POSITIVE LOGITS

相关

0.07

STAR

0.07

宣传

0.07

ﳕ

0.07

addle

0.06

 []↵↵↵

0.06

 제품

0.06

banner

0.06

(Player

0.06

 manifested

0.06

Activations Density 0.031%

No Comments

No Known Activations

No Comments

No Known Activations