INDEX

Explanations

colon

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 aerobic

-0.06

Trust

-0.06

google

-0.06

 geen

-0.06

 rave

-0.06

-health

-0.06

Alarm

-0.06

푸

-0.06

 если

-0.05

href

-0.05

POSITIVE LOGITS

(mapped

0.07

十三

0.07

.clientY

0.07

 Canadiens

0.06

jekt

0.06

 поля

0.06

 harvesting

0.06

 useEffect

0.06

(itemId

0.06

.setImage

0.06

Activations Density 0.037%

colon

No Comments

No Known Activations

colon

No Comments

No Known Activations