INDEX

Explanations

inclusion (math context)

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Shel

-0.07

 posts

-0.07

 inches

-0.06

日前

-0.06

ändig

-0.06

atisch

-0.06

綠

-0.06

 fiber

-0.06

可愛

-0.06

雨

-0.06

POSITIVE LOGITS

structuring

0.08

_Select

0.07

톺

0.07

 sacrificing

0.07

RYPTO

0.07

ᴉ

0.07

.const

0.07

 appropriated

0.07

_fold

0.07

 Stand

0.07

Activations Density 0.012%

inclusion (math context)

No Comments

No Known Activations

inclusion (math context)

No Comments

No Known Activations