INDEX

Explanations

#pragma

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 beautiful

-0.07

FormatException

-0.07

 الإلكترو

-0.07

二十四

-0.07

 آلاف

-0.07

 transl

-0.07

 Fairy

-0.07

.apply

-0.06

 lombok

-0.06

 najbliż

-0.06

POSITIVE LOGITS

ToUpdate

0.06

发动机

0.06

的脸

0.06

Atlantic

0.06

 Scalia

0.06

_air

0.06

肛

0.06

Allen

0.06

嵊

0.06

OfBirth

0.06

Activations Density 0.014%

#pragma

No Comments

No Known Activations

#pragma

No Comments

No Known Activations