INDEX

Explanations

Cle

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_27/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.27.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Map

-0.07

Donald

-0.07

 shock

-0.07

 Pulse

-0.06

-service

-0.06

 ImageView

-0.06

 vascular

-0.06

 tone

-0.06

iov

-0.06

./

-0.06

POSITIVE LOGITS

 свет

0.07

빙

0.06

lernen

0.06

 yarış

0.06

 nilai

0.06

bindParam

0.06

 Georg

0.06

cle

0.06

 outset

0.06

UIAlertAction

0.06

Activations Density 0.003%

Cle

No Comments

No Known Activations

Cle

No Comments

No Known Activations