INDEX

Explanations

has

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 tvrd

-0.07

 Natasha

-0.06

оратив

-0.06

诚

-0.06

ev

-0.06

 urlpatterns

-0.06

då

-0.05

leme

-0.05

(){↵↵

-0.05

 Dialog

-0.05

POSITIVE LOGITS

 QTimer

0.08

 Blow

0.07

OutOfBounds

0.07

 lief

0.07

 Amsterdam

0.07

Generating

0.07

sterdam

0.06

_slave

0.06

μερα

0.06

_while

0.06

Activations Density 0.428%

has

No Comments

No Known Activations

has

No Comments

No Known Activations