INDEX

Explanations

forms requesting names

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Sustainable

-0.07

Dis

-0.06

 oldu

-0.06

Heb

-0.06

消失

-0.06

drm

-0.06

장애

-0.06

망

-0.06

Invoke

-0.06

祝

-0.06

POSITIVE LOGITS

 płyn

0.07

掖

0.07

皦

0.07

 bund

0.07

צפייה

0.07

最美的

0.07

暖气

0.07

侹

0.07

 jars

0.07

ória

0.06

Activations Density 0.091%

forms requesting names

No Comments

No Known Activations

forms requesting names

No Comments

No Known Activations