INDEX

Explanations

technical language

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 broaden

-0.07

 peaks

-0.06

 entrega

-0.06

Crow

-0.06

 outra

-0.06

 Horny

-0.06

 Sections

-0.06

牙

-0.06

 Mothers

-0.06

ucción

-0.06

POSITIVE LOGITS

Logged

0.07

 FStar

0.07

 CharSet

0.07

での

0.07

ErrorResponse

0.06

 commission

0.06

 RelativeLayout

0.06

:animated

0.06

}↵↵↵

0.06

 *);↵↵

0.06

Activations Density 0.342%

technical language

No Comments

No Known Activations

technical language

No Comments

No Known Activations