INDEX

Explanations

-

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Discount

-0.06

rin

-0.06

人类

-0.06

nir

-0.06

-factor

-0.06

(factor

-0.06

 drowning

-0.06

ras

-0.06

Ich

-0.06

/hr

-0.06

POSITIVE LOGITS

 Listed

0.08

.Raw

0.07

 изображ

0.07

_season

0.06

 diseñador

0.06

(mt

0.06

_unsigned

0.06

_UNSIGNED

0.06

removeClass

0.06

↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵

0.06

Activations Density 0.004%

-

No Comments

No Known Activations