INDEX

Explanations

media

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

粤港澳

-0.07

抢占

-0.07

残留

-0.07

痿

-0.07

留意

-0.07

国产

-0.07

珛

-0.07

要好好

-0.07

 SHOW

-0.07

准入

-0.07

POSITIVE LOGITS

.',

0.07

 ------------------------------------------------------------

0.07

ANDING

0.07

 gent

0.07

身旁

0.07

ael

0.07

Tip

0.06

nection

0.06

()</

0.06

****************************

0.06

Activations Density 0.020%

media

No Comments

No Known Activations