INDEX

Explanations

ogen

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

קור

-0.07

风

-0.07

additional

-0.06

 geen

-0.06

 Tara

-0.06

 categorized

-0.06

.ylabel

-0.06

/common

-0.06

Website

-0.06

AC

-0.06

POSITIVE LOGITS

 lesbian

0.08

*******

0.07

jam

0.07

_def

0.07

 Desired

0.07

 malign

0.07

 newState

0.07

 GetString

0.06

ของเรา

0.06

lse

0.06

Activations Density 0.002%

ogen

No Comments

No Known Activations

ogen

No Comments

No Known Activations