INDEX

Explanations

then

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

mwhanna/qwen3-4b-transcoders/layer_23.safetensors

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

163,840

Data Type

float32

Hook Name

blocks.23.mlp.hook_in

Architecture

transcoder

Context Size

8,192

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

å½»

-0.29

theast

-0.28

aten

-0.28

ein

-0.26

#↵↵

-0.26

_append

-0.25

Web

-0.24

ĮĢ

-0.24

combe

-0.24

ueue

-0.24

POSITIVE LOGITS

 individual

0.38

 apparently

0.30

 despite

0.30

 none

0.29

å°½ç®¡

0.28

 although

0.28

individual

0.28

 there

0.28

 even

0.27

/how

0.27

Activations Density 0.013%

then

No Comments

No Known Activations