INDEX

Explanations

warnings or profanity

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

�

-0.06

	results

-0.06

_SHADOW

-0.06

digits

-0.06

бач

-0.06

abit

-0.06

 групп

-0.06

collection

-0.06

печ

-0.06

 COMM

-0.06

POSITIVE LOGITS

 absentee

0.07

истра

0.07

드립니다

0.07

 noticing

0.07

 "-"↵

0.07

.ViewHolder

0.06

�

0.06

 unidentified

0.06

 unmist

0.06

 термін

0.06

Activations Density 0.009%

warnings or profanity

No Comments

No Known Activations