INDEX

Explanations

photo

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Ethics

-0.08

-operation

-0.07

itung

-0.07

心中

-0.07

ARB

-0.07

三百

-0.07

notification

-0.07

免责

-0.07

 spoken

-0.07

 andra

-0.07

POSITIVE LOGITS

CreateDate

0.08

kad

0.07

скоп

0.06

.setHorizontal

0.06

 zdję

0.06

 görm

0.06

 Ukrain

0.06

Hierarchy

0.06

界的

0.06

 parenting

0.06

Activations Density 0.047%

photo

No Comments

No Known Activations

photo

No Comments

No Known Activations