INDEX

Explanations

,

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

mwhanna/qwen3-4b-transcoders/layer_23.safetensors

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

163,840

Data Type

float32

Hook Name

blocks.23.mlp.hook_in

Architecture

transcoder

Context Size

8,192

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ä»¥åıĬ

-0.42

æĸĩåĮĸåĴĮ

-0.40

ç®¡çĲĨåĴĮ

-0.38

åıĬ

-0.37

è´¨éĩıåĴĮ

-0.37

æĪĸæĺ¯

-0.36

æĪĸèĢħ

-0.36

ä»¥åıĬåħ¶ä»ĸ

-0.36

 Ð¸Ð»Ð¸

-0.34

ãģĬãĤĪãģ³

-0.34

POSITIVE LOGITS

 Mime

0.29

 third

0.29

vation

0.28

çļĦæľĢåĲİä¸Ģ

0.27

 ;↵↵↵

0.26

zej

0.26

 hyper

0.25

 Third

0.25

venth

0.24

aves

0.24

Activations Density 0.038%

,

No Comments

No Known Activations