INDEX

Explanations

instructions

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

mwhanna/qwen3-4b-transcoders/layer_11.safetensors

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

163,840

Data Type

float32

Hook Name

blocks.11.mlp.hook_in

Architecture

transcoder

Context Size

8,192

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

åıįèĢĮ

-0.28

Ð¸ÑĩÐµÑģÐº

-0.27

dre

-0.27

ego

-0.26

 Republic

-0.26

å©Ĭ

-0.25

 nond

-0.25

à¸¶à¸ģ

-0.25

 morning

-0.25

 didSet

-0.25

POSITIVE LOGITS

azole

0.30

Ð»ÐµÐ·

0.27

 verd

0.25

Atlas

0.25

oce

0.25

æŁľ

0.24

 Greenwood

0.24

è¹Ħ

0.24

WithOptions

0.24

ä¸ŃéĢĶ

0.23

Activations Density 0.034%

instructions

No Comments

No Known Activations