INDEX

Explanations

Lincoln

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 ============================================================================↵

-0.08

 заб

-0.08

ROS

-0.07

 donate

-0.07

시간

-0.07

 CERT

-0.07

)||(

-0.07

Ϻ

-0.07

 cuda

-0.07

out

-0.07

POSITIVE LOGITS

 Lincoln

0.08

Лени

0.07

]+'

0.07

 İşte

0.07

洗脸

0.07

もちろん

0.07

Al

0.07

 tenía

0.07

.workflow

0.07

淮

0.07

Activations Density 0.005%

Lincoln

No Comments

No Known Activations

Lincoln

No Comments

No Known Activations