INDEX

Explanations

you

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 targeting

-0.06

.pk

-0.06

 escalated

-0.06

 toplam

-0.06

>]

-0.06

 kepada

-0.06

 modificar

-0.06

ução

-0.06

 immersed

-0.06

 sidl

-0.05

POSITIVE LOGITS

 Daniel

0.07

essaging

0.07

Tro

0.07

assel

0.06

 Authorization

0.06

élé

0.06

 Điều

0.06

 Monter

0.06

buff

0.06

Activations Density 0.001%

you

No Comments

No Known Activations