INDEX

Explanations

say "comma"

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 radioactive

-0.07

.InputStreamReader

-0.07

仅供

-0.07

 devote

-0.07

Sto

-0.06

 datingsider

-0.06

rf

-0.06

 yeni

-0.06

aving

-0.06

 Narrow

-0.06

POSITIVE LOGITS

 ellas

0.07

RR

0.07

슝

0.07

 verdict

0.07

_INSERT

0.07

_limit

0.07

XXXX

0.07

 withObject

0.07

_repeat

0.07

_payment

0.07

Activations Density 0.130%

say "comma"

No Comments

No Known Activations

say "comma"

No Comments

No Known Activations