INDEX

Explanations

HTML display properties

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ils

-0.07

܈

-0.07

aryl

-0.07

وصل

-0.07

 בהם

-0.07

 حق

-0.07

Jones

-0.07

oblins

-0.07

utes

-0.06

POSITIVE LOGITS

特色的

0.07

단

0.07

Tab

0.07

擿

0.07

?(

0.07

','=',$

0.07

strstr

0.07

 Recap

0.06

BX

0.06

단

0.06

Activations Density 0.000%

HTML display properties

No Comments

No Known Activations

HTML display properties

No Comments

No Known Activations