INDEX

Explanations

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

_Add

-0.06

_TRACK

-0.06

Album

-0.06

 Kunst

-0.06

stanbul

-0.06

Tou

-0.06

閉

-0.06

itzer

-0.06

 vertically

-0.06

 Walt

-0.06

POSITIVE LOGITS

 produces

0.07

 graduation

0.07

lbs

0.07

\",↵

0.07

 networking

0.07

 exponent

0.07

msgs

0.07

 '"';↵

0.06

 destroys

0.06

HTTPS

0.06

Activations Density 0.000%

more

No Comments

No Known Activations