INDEX

Explanations

FACE

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_27/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.27.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 reusable

-0.07

currentUser

-0.06

kur

-0.06

 titleLabel

-0.06

iếm

-0.06

 Renaissance

-0.06

 onemocnění

-0.06

 variety

-0.06

 childbirth

-0.06

 cita

-0.06

POSITIVE LOGITS

 Wohn

0.07

.AddScoped

0.07

_BEGIN

0.06

usi

0.06

PPP

0.06

ensored

0.06

berger

0.06

ssl

0.06

Charsets

0.06

DevExpress

0.06

Activations Density 0.095%

FACE

No Comments

No Known Activations

FACE

No Comments

No Known Activations