INDEX

Explanations

an

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

-trained

-0.07

 Circus

-0.07

 Biden

-0.06

 british

-0.06

_filtered

-0.06

 honor

-0.06

sticks

-0.06

ckeditor

-0.06

 ACTIONS

-0.06

 стали

-0.06

POSITIVE LOGITS

(correct

0.07

ูล

0.06

.curr

0.06

овор

0.06

;}

0.06

 borrower

0.06

 RedirectTo

0.06

Dados

0.06

(proxy

0.06

DEPEND

0.06

Activations Density 0.000%

an

No Comments

No Known Activations