INDEX

Explanations

north

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

.normal

-0.07

 classified

-0.06

 propri

-0.06

_requested

-0.06

Visual

-0.06

 mini

-0.06

 contractual

-0.06

 Unknown

-0.06

 Prop

-0.06

_handle

-0.06

POSITIVE LOGITS

 East

0.12

East

0.10

 West

0.10

 east

0.09

 north

0.09

 Southern

0.08

 EAST

0.08

West

0.08

South

0.08

 North

0.08

Activations Density 0.024%

north

No Comments

No Known Activations

north

No Comments

No Known Activations