INDEX

Explanations

percentage

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 confusion

-0.06

 의해

-0.06

APolynomial

-0.06

 Trey

-0.06

ียน

-0.06

Suk

-0.06

 Arsenal

-0.06

sơ

-0.06

ur

-0.05

bson

-0.05

POSITIVE LOGITS

 setError

0.07

percentage

0.07

 heartbeat

0.07

YouTube

0.07

unction

0.07

ोश

0.07

-Identifier

0.06

lığının

0.06

 nơi

0.06

 compares

0.06

Activations Density 0.003%

percentage

No Comments

No Known Activations

percentage

No Comments

No Known Activations