INDEX

Explanations

amphetamine

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 ---------------------------------------------------------------------------↵

-0.07

 Brittany

-0.07

更

-0.07

 trúc

-0.07

*******************************************************************************/↵

-0.07

 Theresa

-0.07

变迁

-0.07

 **/↵

-0.07

getValue

-0.07

 Guerrero

-0.06

POSITIVE LOGITS

乡镇

0.08

 inability

0.07

 большим

0.07

ической

0.07

Connector

0.07

-checkbox

0.07

arrow

0.07

党委

0.06

-minded

0.06

늙

0.06

Activations Density 0.001%

amphetamine

No Comments

No Known Activations

amphetamine

No Comments

No Known Activations