INDEX

Explanations

age

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

老实

-0.07

failed

-0.07

illos

-0.07

更好

-0.07

到底

-0.07

习近平

-0.07

Sob

-0.06

 akin

-0.06

 Pitt

-0.06

 وجه

-0.06

POSITIVE LOGITS

HS

0.08

Processes

0.08

 //////////////////////////////////////////////////////////////////////

0.07

unci

0.07

 Events

0.07

 Bulgarian

0.07

 Frame

0.07

(option

0.07

Daemon

0.07

ทร

0.07

Activations Density 0.002%

age

No Comments

No Known Activations

age

No Comments

No Known Activations