INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Knight

-0.08

 Ecuador

-0.07

 Event

-0.07

 Ronald

-0.07

 adaptive

-0.07

 waterfall

-0.07

_domain

-0.07

 airline

-0.07

 sworn

-0.07

顶级

-0.07

POSITIVE LOGITS

isa

0.08

我在

0.07

蓄

0.06

 bless

0.06

 מבוס

0.06

"{

0.06

YM

0.06

js

0.06

saw

0.06

 Jess

0.06

Activations Density 0.001%

No Comments

No Known Activations