INDEX

Explanations

propose

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

SEX

-0.07

LCD

-0.07

!..

-0.07

 CENTER

-0.07

 literally

-0.07

 vivid

-0.06

ukkit

-0.06

 sound

-0.06

led

-0.06

Lex

-0.06

POSITIVE LOGITS

 proposed

0.18

 propose

0.15

 proposing

0.14

 Proposed

0.14

 proposes

0.12

 proposal

0.10

 Proposal

0.10

 proposals

0.09

placer

0.08

Proposal

0.08

Activations Density 0.023%

propose

No Comments

No Known Activations

propose

No Comments

No Known Activations