INDEX

Explanations

Informal/personal conversations

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Kens

-0.07

attering

-0.07

qry

-0.06

 desea

-0.06

lé

-0.06

$j

-0.06

 Kathryn

-0.06

ref

-0.06

 Bates

-0.06

 seguridad

-0.06

POSITIVE LOGITS

FromFile

0.07

eyer

0.06

/py

0.06

امت

0.06

Il

0.06

iyas

0.06

oha

0.06

olithic

0.06

 Infer

0.06

 diffé

0.06

Activations Density 0.062%

Informal/personal conversations

No Comments

No Known Activations