INDEX

Explanations

a, an

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

mwhanna/qwen3-4b-transcoders/layer_19.safetensors

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

163,840

Data Type

float32

Hook Name

blocks.19.mlp.hook_in

Architecture

transcoder

Context Size

8,192

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ç¯ĳ

-0.32

errat

-0.29

çļĦæĥħåĨµ

-0.26

buch

-0.26

baugh

-0.25

InChildren

-0.24

å¸¦ä½ł

-0.24

 Perr

-0.24

 cuis

-0.23

Interior

-0.23

POSITIVE LOGITS

åĳ½

0.29

fixtures

0.28

orate

0.27

vention

0.26

Apis

0.25

æ¡¡

0.24

 glide

0.23

æĹłè®ºå¦Ĥä½ķ

0.23

pressions

0.23

aten

0.23

Activations Density 0.358%

a, an

No Comments

No Known Activations