INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_23/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.23.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

慷

-0.08

dıktan

-0.07

 cocaine

-0.07

 onKeyDown

-0.07

gs

-0.07

ἥ

-0.07

接受了

-0.07

celona

-0.07

Gn

-0.06

csr

-0.06

POSITIVE LOGITS

 block

0.07

.imgur

0.07

study

0.07

 ours

0.06

Reviewed

0.06

 form

0.06

 scripting

0.06

.azure

0.06

____

0.06

 الأساسية

0.06

Activations Density 0.070%

No Comments

No Known Activations