INDEX

Explanations

ized

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

\modules

-0.07

/Z

-0.07

 forIndexPath

-0.07

 учрежд

-0.06

 truthful

-0.06

mue

-0.06

 traitement

-0.06

 adul

-0.06

 scient

-0.06

нение

-0.06

POSITIVE LOGITS

Р

0.07

 Waters

0.07

iked

0.06

	properties

0.06

 Smile

0.06

 Carson

0.06

JA

0.06

 Barrett

0.06

SERVER

0.05

DROP

0.05

Activations Density 0.001%

ized

No Comments

No Known Activations