INDEX

Explanations

)

np_max-act · gemini-2.0-flash

section and metadata headers that signal Wikipedia/encyclopedia-style article structure

oai_token-act-pair · gpt-5 Triggered by @vetterc0

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

of

-0.07

to

-0.07

 Morning

-0.06

abby

-0.06

cca

-0.06

 pInfo

-0.06

ыџN

-0.06

[axis

-0.06

POSITIVE LOGITS

)↵↵

0.11

.↵↵

0.11

↵↵

0.11

 //
↵
↵

0.11

↵↵

0.11

(){
↵
↵

0.10

).↵↵

0.10

)")↵↵

0.10

:↵↵

0.10

".↵↵

0.10

Activations Density 1.241%

)

section and metadata headers that signal Wikipedia/encyclopedia-style article structure

No Comments

No Known Activations