INDEX

Explanations

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Sentinel

-0.08

개

-0.07

difference

-0.07

-key

-0.06

December

-0.06

TestData

-0.06

 Atmospheric

-0.06

.Display

-0.06

 skirm

-0.06

𝖓

-0.06

POSITIVE LOGITS

hum

0.08

.PackageManager

0.07

💓

0.07

HL

0.07

مستقبل

0.07

谈谈

0.07

 decryption

0.07

ADD

0.07

 tunes

0.07

 agreeing

0.07

Activations Density 0.006%

more

No Comments

No Known Activations

more

No Comments

No Known Activations