INDEX

Explanations

ethn

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

mwhanna/qwen3-4b-transcoders/layer_11.safetensors

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

163,840

Data Type

float32

Hook Name

blocks.11.mlp.hook_in

Architecture

transcoder

Context Size

8,192

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

priv

-0.31

icode

-0.28

PCR

-0.25

Priv

-0.25

åħīæĺİ

-0.25

,width

-0.25

-hot

-0.24

/link

-0.24

 priv

-0.24

loid

-0.23

POSITIVE LOGITS

issen

0.27

illes

0.27

Cnt

0.27

_cnt

0.27

ahr

0.26

æĦıè¯Ĩ

0.26

yps

0.26

asca

0.26

å¾Ĵ

0.25

å°ĳè§ģ

0.25

Activations Density 0.336%

ethn

No Comments

No Known Activations