INDEX

Explanations

Scientific publications/addresses

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

_parser

-0.08

_nom

-0.07

货币政策

-0.07

ANCH

-0.07

إنش

-0.07

ponge

-0.07

AP

-0.07

composer

-0.07

.getZ

-0.07

nuts

-0.07

POSITIVE LOGITS

 בארה

0.08

ياة

0.08

ادة

0.07

تأ

0.07

larında

0.07

allel

0.07

Eyl

0.07

aled

0.07

 słab

0.07

クリニック

0.07

Activations Density 0.012%

Scientific publications/addresses

No Comments

No Known Activations