INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

其中一个

-0.07

 suspected

-0.07

 aslı

-0.07

风景

-0.07

 thousands

-0.06

 forn

-0.06

自助

-0.06

whatever

-0.06

Sorry

-0.06

讓人

-0.06

POSITIVE LOGITS

 mạng

0.07

にして

0.07

.string

0.07

граф

0.07

>T

0.07

 ComboBox

0.07

":@"

0.07

_IA

0.07

 slicing

0.07

شاش

0.07

Activations Density 0.084%

No Comments

No Known Activations

No Comments

No Known Activations