INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

梌

-0.08

幼儿园

-0.07

 Datum

-0.07

�

-0.07

idUser

-0.07

 murderer

-0.07

qualities

-0.07

 réalisé

-0.07

 brutally

-0.07

	unit

-0.07

POSITIVE LOGITS

.preventDefault

0.08

ERSIST

0.08

VA

0.07

.BOTTOM

0.07

مس

0.07

เสมอ

0.07

-dismiss

0.07

urn

0.07

的增长

0.07

AE

0.07

Activations Density 0.075%

No Comments

No Known Activations

No Comments

No Known Activations