INDEX

Explanations

computer technology acronyms

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

etak

-0.07

 Teresa

-0.07

 Pain

-0.07

պ

-0.06

aina

-0.06

.factor

-0.06

reiben

-0.06

_pi

-0.06

贝

-0.06

_pin

-0.06

POSITIVE LOGITS

 userList

0.07

vel

0.07

回调

0.07

(docs

0.07

um

0.06

 instanceof

0.06

 */,↵

0.06

 provoke

0.06

 curled

0.06

postId

0.06

Activations Density 0.145%

computer technology acronyms

No Comments

No Known Activations

computer technology acronyms

No Comments

No Known Activations