INDEX

Explanations

such

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 semaphore

-0.06

.monitor

-0.06

uckle

-0.06

 argument

-0.06

.types

-0.06

 breakdown

-0.06

 State

-0.06

ETYPE

-0.06

 plastics

-0.06

.stem

-0.06

POSITIVE LOGITS

----------------

0.07

fr

0.07

(_,

0.07

 daunting

0.06

“↵↵

0.06

essed

0.06

 rodents

0.06

_both

0.06

 spolupráci

0.06

 clue

0.06

Activations Density 0.042%

such

No Comments

No Known Activations

such

No Comments

No Known Activations