INDEX

Explanations

Institute, Museum

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 latch

-0.07

Sortable

-0.07

新

-0.06

 Rosie

-0.06

January

-0.06

 Amendments

-0.06

 +#+#+#+

-0.06

Activation

-0.06

-gap

-0.06

Fight

-0.06

POSITIVE LOGITS

_paths

0.07

 Bound

0.07

Bil

0.07

 wichtig

0.07

 photographs

0.07

 Atlantis

0.07

网站地图

0.07

(indices

0.07

HTTPS

0.07

右侧

0.07

Activations Density 0.036%

Institute, Museum

No Comments

No Known Activations