INDEX

Explanations

Master

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Benz

-0.07

if

-0.07

Feb

-0.07

 göl

-0.07

 nearly

-0.07

yny

-0.07

Coy

-0.07

 Cheney

-0.06

 enthusiastic

-0.06

.Images

-0.06

POSITIVE LOGITS

 Master

0.19

 master

0.17

Master

0.16

 Masters

0.15

master

0.15

 masters

0.14

 MASTER

0.14

-master

0.11

MASTER

0.11

masters

0.11

Activations Density 0.017%

Master

No Comments

No Known Activations