INDEX

Explanations

2

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

.temperature

-0.07

нич

-0.06

wechat

-0.06

 component

-0.06

 ciclo

-0.06

BR

-0.06

 France

-0.06

 nationals

-0.06

.blogspot

-0.06

sla

-0.06

POSITIVE LOGITS

timestamps

0.08

 swagger

0.07

 Environmental

0.06

 orang

0.06

aison

0.06

 ambiance

0.06

५

0.06

}")]↵

0.06

xf

0.06

.DeepEqual

0.06

Activations Density 0.033%

2

No Comments

No Known Activations

2

No Comments

No Known Activations