INDEX

Explanations

News and opinions articles

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 gums

-0.07

archivo

-0.07

(Xml

-0.07

 clause

-0.07

vim

-0.07

 pang

-0.07

 cantidad

-0.07

 формы

-0.07

/plugins

-0.07

pap

-0.07

POSITIVE LOGITS

*_

0.07

.Zero

0.07

잭

0.07

SHOP

0.07

结局

0.07

 recipro

0.07

.insertBefore

0.07

 Feder

0.06

onDelete

0.06

创客

0.06

Activations Density 0.120%

News and opinions articles

No Comments

No Known Activations

News and opinions articles

No Comments

No Known Activations