INDEX

Explanations

business, customers, services

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_27/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.27.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Spi

-0.06

には

-0.06

 constitution

-0.06

 Pavel

-0.06

他們

-0.06

-mean

-0.06

ูช

-0.06

/settings

-0.06

 anesthesia

-0.06

/System

-0.06

POSITIVE LOGITS

かい

0.07

($.

0.06

 sane

0.06

 READY

0.06

 awful

0.06

emie

0.06

ไล

0.06

.fade

0.06

 sonra

0.06

 coached

0.06

Activations Density 0.263%

business, customers, services

No Comments

No Known Activations

business, customers, services

No Comments

No Known Activations