INDEX

Explanations

Everywhere; among us

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

bbe

-0.06

 Hern

-0.06

_coord

-0.06

Awesome

-0.06

 Bren

-0.06

 Palm

-0.06

 sécurité

-0.06

Iteration

-0.06

Trial

-0.05

 наук

-0.05

POSITIVE LOGITS

 sway

0.07

 warrior

0.07

 использ

0.07

 Fool

0.07

({'

0.07

 clashed

0.07

_]

0.06

UsingEncoding

0.06

 }}"></

0.06

 ArgumentException

0.06

Activations Density 0.166%

Everywhere; among us

No Comments

No Known Activations

Everywhere; among us

No Comments

No Known Activations