INDEX

Explanations

al

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_23/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.23.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

vergence

-0.07

ker

-0.07

оны

-0.07

Lem

-0.06

Gö

-0.06

terr

-0.06

�

-0.06

Urb

-0.06

pp

-0.06

 OrderedDict

-0.06

POSITIVE LOGITS

Manchester

0.07

 Manchester

0.07

					↵					↵

0.07

aghan

0.06

 νεφοκ

0.06

.Millisecond

0.06

.btnSave

0.06

"@"

0.06

\Image

0.06

`,`

0.06

Activations Density 0.021%

al

No Comments

No Known Activations

al

No Comments

No Known Activations