INDEX

Explanations

Numbers and currency

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

PlotsExplanationShow Test FieldDefault Test Text

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 placement

-0.08

�建

-0.08

Dry

-0.07

lings

-0.07

.nz

-0.07

/document

-0.07

_singleton

-0.07

 هزار

-0.07

 derec

-0.07

 double

-0.07

POSITIVE LOGITS

Affected

0.07

 ICollection

0.06

//================================================================

0.06

brıs

0.06

 obscured

0.06

 dele

0.06

าธ

0.06

rhs

0.06

 Barack

0.06

 Experts

0.06

Activations Density 0.141%

Numbers and currency

No Comments

No Known Activations