INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

White

-0.08

all

-0.08

 сумму

-0.08

/New

-0.07

choice

-0.07

idas

-0.07

component

-0.07

_pow

-0.07

jej

-0.07

降低

-0.07

POSITIVE LOGITS

集装箱

0.08

仓库

0.08

难民

0.08

台阶

0.07

YG

0.07

缝隙

0.07

.RegularExpressions

0.07

 uluslararası

0.07

只需要

0.07

 nuances

0.07

Activations Density 0.079%

No Comments

No Known Activations