INDEX

Explanations

Dates, coding, and inclusion

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 hvor

-0.07

 portrays

-0.06

一枚

-0.06

 song

-0.06

ں

-0.06

kle

-0.06

丞

-0.06

Elf

-0.06

Ж

-0.06

 melhores

-0.06

POSITIVE LOGITS

 debería

0.07

一致性

0.07

愃

0.06

lobals

0.06

final

0.06

odynamics

0.06

.Platform

0.06

_Module

0.06

 cụ

0.06

 finishes

0.06

Activations Density 0.049%

Dates, coding, and inclusion

No Comments

No Known Activations

Dates, coding, and inclusion

No Comments

No Known Activations