INDEX

Explanations

date/time differences

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

_activation

-0.07

enaries

-0.07

.products

-0.07

(clazz

-0.07

tips

-0.07

 اشاره

-0.06

 Marxist

-0.06

 Drops

-0.06

 annotation

-0.06

_OCC

-0.06

POSITIVE LOGITS

<>("

0.06

⌒

0.06

�

0.06

 사망

0.06

Выб

0.06

 teasing

0.06

,’

0.06

ey

0.06

 tast

0.06

 unnatural

0.06

Activations Density 0.019%

date/time differences

No Comments

No Known Activations