INDEX

Explanations

numbers and comparisons

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

品位

-0.08

.setTo

-0.07

迎接

-0.07

附加值

-0.07

AJ

-0.07

化进程

-0.07

 категории

-0.07

 visualize

-0.07

をご

-0.07

愍

-0.07

POSITIVE LOGITS

weather

0.07

 instantaneous

0.07

 Drum

0.07

 banking

0.07

WARNING

0.07

崒

0.07

좡

0.07

记得

0.07

ści

0.06

 barley

0.06

Activations Density 0.007%

numbers and comparisons

No Comments

No Known Activations

numbers and comparisons

No Comments

No Known Activations