INDEX

Explanations

kind/sort

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

mwhanna/qwen3-4b-transcoders/layer_11.safetensors

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

163,840

Data Type

float32

Hook Name

blocks.11.mlp.hook_in

Architecture

transcoder

Context Size

8,192

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

?.

-0.25

rink

-0.25

åŁİ

-0.25

æľīä¸Ģæ¬¡

-0.25

rown

-0.25

áº£y

-0.25

Ø¸Ø§Ùħ

-0.24

??

-0.24

æ··

-0.24

utan

-0.24

POSITIVE LOGITS

esModule

0.28

æķ°æį®ä¸Ńå¿ĥ

0.27

.extent

0.27

CUDA

0.27

 spilled

0.25

 Earth

0.25

Earth

0.25

 sidel

0.24

.getCount

0.23

åľ°çĲĥ

0.23

Activations Density 0.005%

kind/sort

No Comments

No Known Activations