INDEX

Explanations

Beginning of documents

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 rewards

-0.07

Tro

-0.07

Tro

-0.06

 Tube

-0.06

Fee

-0.06

Ara

-0.06

베

-0.06

');//

-0.06

 Truth

-0.06

Pro

-0.06

POSITIVE LOGITS

(contents

0.07

(\"

0.07

RYPT

0.07

chin

0.06

aylor

0.06

_PLATFORM

0.06

 minOccurs

0.06

 quilt

0.06

-il

0.06

-selected

0.06

Activations Density 0.011%

Beginning of documents

No Comments

No Known Activations

Beginning of documents

No Comments

No Known Activations