INDEX

Explanations

dear

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Mitch

-0.07

 слово

-0.07

Nothing

-0.07

}")

-0.07

 два

-0.07

 Kostenlos

-0.06

 clinically

-0.06

 Health

-0.06

↵

-0.06

 technically

-0.06

POSITIVE LOGITS

 dear

0.10

Dear

0.09

 Dear

0.09

 quer

0.07

ears

0.07

-Mail

0.07

Pear

0.06

rial

0.06

<dim

0.06

 Pear

0.06

Activations Density 0.008%

dear

No Comments

No Known Activations