INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

�

-0.07

	free

-0.07

("/

-0.07

	scene

-0.07

 routes

-0.07

>List

-0.07

()],↵

-0.06

 centres

-0.06

blick

-0.06

 bulk

-0.06

POSITIVE LOGITS

🔑

0.07

 Shaw

0.07

ἄ

0.07

 setters

0.07

ewear

0.07

 tweaked

0.06

ifter

0.06

 bilingual

0.06

找个

0.06

 Taiwanese

0.06

Activations Density 0.030%

No Comments

No Known Activations