INDEX

Explanations

0

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

时代的

-0.07

_DIR

-0.07

.Extensions

-0.07

.coroutines

-0.07

 amnesty

-0.06

_visit

-0.06

 telefon

-0.06

只要你

-0.06

 conceivable

-0.06

 RoundedRectangleBorder

-0.06

POSITIVE LOGITS

wła

0.07

である

0.07

езд

0.07

 modelName

0.07

.df

0.07

ホール

0.07

הול

0.07

ass

0.06

 Equivalent

0.06

骣

0.06

Activations Density 0.002%

0

No Comments

No Known Activations

0

No Comments

No Known Activations