INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

命中

-0.07

XYZ

-0.06

 driving

-0.06

INavigationController

-0.06

뻔

-0.06

 pacman

-0.06

�

-0.06

BSP

-0.06

 Marxism

-0.06

大幅提升

-0.06

POSITIVE LOGITS

']]

0.07

 Thumb

0.07

კ

0.07

 madre

0.07

 Printing

0.07

 twig

0.07

ọ

0.07

 ağrı

0.07

ccion

0.07

()?;↵

0.07

Activations Density 0.014%

No Comments

No Known Activations

No Comments

No Known Activations