INDEX

Explanations

nickel

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_3/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.3.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

/general

-0.07

Forgery

-0.06

SHOP

-0.06

 المج

-0.06

eru

-0.06

 RandomForest

-0.06

lost

-0.06

ertation

-0.06

roken

-0.06

 yıllarda

-0.06

POSITIVE LOGITS

 aluminum

0.07

 Cộng

0.07

 ав

0.06

.records

0.06

oram

0.06

 свід

0.06

steel

0.06

 Passenger

0.06

"]))

0.06

 měly

0.06

Activations Density 0.026%

nickel

No Comments

No Known Activations