INDEX

Explanations

code and technical documentation

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

estate

-0.07

itez

-0.07

бря

-0.07

 broadcasting

-0.07

强奸

-0.07

(regex

-0.07

ató

-0.07

_create

-0.07

ств

-0.07

нстру

-0.06

POSITIVE LOGITS

ERTICAL

0.07

AD

0.07

MAV

0.07

Av

0.07

海淀区

0.07

AL

0.06

蔽

0.06

 pulls

0.06

.setLayout

0.06

 Forced

0.06

Activations Density 0.038%

code and technical documentation

No Comments

No Known Activations