INDEX

Explanations

Engineering and physics contexts

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 addSubview

-0.07

 babes

-0.07

เผ

-0.07

Cnt

-0.07

 mindful

-0.07

 Pharm

-0.07

 Thị

-0.07

 عدة

-0.07

的人生

-0.07

Subview

-0.07

POSITIVE LOGITS

ﴘ

0.07

缩减

0.06

□

0.06

ALLED

0.06

واس

0.06

性价比

0.06

 الإنسان

0.06

𝚏

0.06

eners

0.06

 russian

0.06

Activations Density 0.012%

Engineering and physics contexts

No Comments

No Known Activations