INDEX

Explanations

language descriptions

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

زية

-0.07

상담

-0.06

 ipad

-0.06

_account

-0.06

,g

-0.06

_FUNCTION

-0.06

Accordion

-0.06

८

-0.06

งศ

-0.06

ิเศษ

-0.06

POSITIVE LOGITS

uly

0.07

 그녀

0.07

LEE

0.06

 Düş

0.06

inn

0.06

Ter

0.06

lee

0.06

(run

0.06

 Auss

0.06

.Sort

0.06

Activations Density 0.027%

language descriptions

No Comments

No Known Activations

language descriptions

No Comments

No Known Activations