INDEX

Explanations

simple

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_3/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.3.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 jamais

-0.07

hiro

-0.07

スタ

-0.06

 plunder

-0.06

hid

-0.06

 roman

-0.06

нимать

-0.06

 leur

-0.06

 Richmond

-0.06

 Andreas

-0.06

POSITIVE LOGITS

 simple

0.09

simple

0.09

.simple

0.09

Simple

0.09

_simple

0.08

_SIMPLE

0.08

_easy

0.07

.Simple

0.07

/simple

0.07

	Simple

0.07

Activations Density 0.022%

simple

No Comments

No Known Activations