INDEX

Explanations

requests

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 getchar

-0.07

千方

-0.07

 ontvangst

-0.07

 strategically

-0.07

.GetChild

-0.07

 глав

-0.07

变更

-0.07

 đóng

-0.07

福田

-0.06

煓

-0.06

POSITIVE LOGITS

noticed

0.08

랄

0.08

 рекл

0.07

_modified

0.07

 ################################################################

0.07

Analy

0.07

 noticeable

0.07

🚲

0.07

lève

0.07

("\"

0.07

Activations Density 0.005%

requests

No Comments

No Known Activations