INDEX

Explanations

dog products

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_27/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.27.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

sets

-0.07

error

-0.07

.python

-0.07

 COMPUTER

-0.07

that

-0.07

"You

-0.06

------------------------------

-0.06

**(

-0.06

SET

-0.06

 recalled

-0.06

POSITIVE LOGITS

ableOpacity

0.08

 Sabb

0.08

การพ

0.07

 ممن

0.06

ATL

0.06

-learning

0.06

 hodnocení

0.06

 Fill

0.06

 hoog

0.05

 europ

0.05

Activations Density 0.023%

dog products

No Comments

No Known Activations