INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_23/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.23.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Laptop

-0.07

sprintf

-0.07

并不代表

-0.07

	holder

-0.07

 pork

-0.07

脑袋

-0.06

 Dodgers

-0.06

 evapor

-0.06

Derived

-0.06

 Alien

-0.06

POSITIVE LOGITS

 adress

0.07

减免

0.07

CE

0.07

.users

0.07

ܓ

0.07

 adresse

0.07

ucing

0.07

毒性

0.07

oulouse

0.07

elite

0.07

Activations Density 0.000%

No Comments

No Known Activations