INDEX

Explanations

colons and underscores

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

embedded

-0.08

(conf

-0.07

Monkey

-0.07

_comp

-0.07

 significance

-0.07

 unborn

-0.07

HW

-0.07

𝐋

-0.07

 flashlight

-0.07

ני

-0.07

POSITIVE LOGITS

经营理念

0.08

澎湃

0.07

铆

0.07

倕

0.07

ทะเล

0.06

这就

0.06

ptides

0.06

oracle

0.06

╭

0.06

gran

0.06

Activations Density 0.094%

colons and underscores

No Comments

No Known Activations

colons and underscores

No Comments

No Known Activations