INDEX

Explanations

brackets and at signs

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Satoshi

-0.07

宛如

-0.07

 Rosa

-0.06

olk

-0.06

ResourceId

-0.06

就是为了

-0.06

俄罗斯

-0.06

 […]

-0.06

submitButton

-0.06

.Process

-0.06

POSITIVE LOGITS

התייחס

0.07

anges

0.07

Quantity

0.07

бил

0.07

uant

0.07

//****************************************************************

0.07

 ################################################

0.07

beeld

0.06

_ud

0.06

 Cable

0.06

Activations Density 0.002%

brackets and at signs

No Comments

No Known Activations