INDEX

Explanations

seen

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

.NORTH

-0.07

 eaten

-0.07

 reveal

-0.06

STR

-0.06

 seeking

-0.06

 Trying

-0.06

).↵↵↵

-0.06

 вра

-0.06

рег

-0.06

 Ка

-0.06

POSITIVE LOGITS

.logout

0.07

 Bubble

0.06

-sheet

0.06

.baseUrl

0.06

antis

0.06

 mView

0.06

aption

0.06

ValueHandling

0.06

Scheduler

0.06

ผ

0.06

Activations Density 0.329%

seen

No Comments

No Known Activations

seen

No Comments

No Known Activations