INDEX

Explanations

parenthood, motherhood

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

状

-0.07

狀

-0.07

ギ

-0.07

采购

-0.06

αρ

-0.06

 entrepreneurship

-0.06

�i

-0.06

 disobed

-0.06

 weitere

-0.06

ущ

-0.06

POSITIVE LOGITS

_HAVE

0.07

 Intern

0.07

/Admin

0.06

 schl

0.06

 FLAG

0.06

rem

0.06

 bóng

0.06

 unify

0.06

([]

0.06

_Tr

0.06

Activations Density 0.085%

parenthood, motherhood

No Comments

No Known Activations

parenthood, motherhood

No Comments

No Known Activations