INDEX

Explanations

in

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Hib

-0.08

>()

-0.06

 aggregates

-0.06

.toolStripButton

-0.06

stock

-0.06

light

-0.06

ATK

-0.06

ib

-0.06

 держав

-0.06

/**

-0.06

POSITIVE LOGITS

(result

0.07

 ),↵↵

0.07

 experimented

0.06

čas

0.06

_suffix

0.06

(Page

0.06

 specifics

0.06

供

0.06

_relative

0.06

 {:?}",

0.06

Activations Density 0.075%

in

No Comments

No Known Activations