INDEX

Explanations

architecture and design

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 consequ

-0.07

amin

-0.07

anvas

-0.06

My

-0.06

 Shakespeare

-0.06

ase

-0.06

 exemption

-0.06

等到

-0.06

 Derneği

-0.06

괗

-0.06

POSITIVE LOGITS

*f

0.07

阖

0.07

hm

0.07

创业者

0.07

륙

0.06

startDate

0.06

思想

0.06

휀

0.06

 XCTest

0.06

学子

0.06

Activations Density 0.057%

architecture and design

No Comments

No Known Activations

architecture and design

No Comments

No Known Activations