INDEX

Explanations

/copyleft

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Classes

-0.08

Pred

-0.07

 بأ

-0.06

 Michel

-0.06

implify

-0.06

再

-0.06

üre

-0.06

	word

-0.06

 fears

-0.06

 halk

-0.06

POSITIVE LOGITS

/copyleft

0.08

KeySpec

0.07

?“↵↵

0.07

VID

0.06

ScreenState

0.06

 Krishna

0.06

 TObject

0.06

 commercially

0.06

 Ezek

0.06

.findByIdAndUpdate

0.06

Activations Density 0.001%

/copyleft

No Comments

No Known Activations