INDEX

Explanations

5

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

PlotsExplanationShow Test FieldDefault Test Text

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Present

-0.07

 goTo

-0.07

้แก

-0.07

 territory

-0.06

겁

-0.06

liced

-0.06

Bid

-0.06

 Bols

-0.06

े।

-0.06

 invaded

-0.06

POSITIVE LOGITS

 DispatchQueue

0.07

-det

0.06

/arm

0.06

\Cache

0.06

 realise

0.06

に向

0.06

主

0.06

ud

0.06

 filmmakers

0.06

시는

0.06

Activations Density 0.001%

5

No Comments

No Known Activations