INDEX

Explanations

groups of two or three

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

.OPEN

-0.07

>[↵

-0.07

ב

-0.06

.functions

-0.06

 enctype

-0.06

_j

-0.06

Arguments

-0.06

Mountain

-0.06

 культур

-0.06

 badly

-0.06

POSITIVE LOGITS

duo

0.15

 trio

0.14

Duo

0.11

 threesome

0.09

 Trio

0.09

uo

0.08

 Quart

0.08

 quart

0.08

 pair

0.08

0.07

Activations Density 0.006%

groups of two or three

No Comments

No Known Activations