INDEX

Explanations

scientific/technical language

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

shot

-0.07

tif

-0.07

DIY

-0.07

נוס

-0.07

쏜

-0.07

俪

-0.07

ündig

-0.07

时辰

-0.06

ucid

-0.06

folk

-0.06

POSITIVE LOGITS

方向盘

0.08

批发市场

0.08

	handler

0.08

 Park

0.08

中断

0.08

↵

0.08

_ini

0.08

רקע

0.07

 Discord

0.07

 horses

0.07

Activations Density 0.506%

scientific/technical language

No Comments

No Known Activations

scientific/technical language

No Comments

No Known Activations