INDEX

Explanations

code

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 República

-0.08

国民党

-0.08

ʤ

-0.07

_imgs

-0.07

淡

-0.07

зн

-0.07

ễn

-0.06

 Québec

-0.06

刻意

-0.06

狸

-0.06

POSITIVE LOGITS

 Raum

0.07

Mid

0.07

 ()
↵

0.07

////////////////////////////////////////////////////////////////////////////////↵

0.07

?)↵

0.07

']))↵

0.07

()>↵

0.07

_checkbox

0.07

 التداول

0.07

);↵↵

0.07

Activations Density 0.136%

code

No Comments

No Known Activations

code

No Comments

No Known Activations