INDEX

Explanations

optical and material properties

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

xec

-0.07

ühr

-0.07

 quizá

-0.07

浴

-0.07

uguay

-0.07

 כניסה

-0.07

 marzo

-0.07

 continues

-0.07

 proprio

-0.07

 הקודם

-0.07

POSITIVE LOGITS

_FIRE

0.07

,model

0.07

}";↵

0.07

 volcan

0.06

 buckets

0.06

 comforting

0.06

欣慰

0.06

_LOOK

0.06

_known

0.06

,"↵

0.06

Activations Density 0.038%

optical and material properties

No Comments

No Known Activations

optical and material properties

No Comments

No Known Activations