INDEX

Explanations

integrated

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 besides

-0.06

 обо

-0.06

 Schwarz

-0.06

.lesson

-0.06

 Notice

-0.06

 Province

-0.06

 pepper

-0.06

(ship

-0.06

 upon

-0.06

psi

-0.06

POSITIVE LOGITS

 integrated

0.12

 Integrated

0.10

 integrate

0.10

 integration

0.09

Integrated

0.08

 integrates

0.08

 integral

0.08

 Integration

0.08

quate

0.08

egrated

0.07

Activations Density 0.020%

integrated

No Comments

No Known Activations