INDEX

Explanations

past tense verbs

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

mwhanna/qwen3-4b-transcoders/layer_19.safetensors

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

163,840

Data Type

float32

Hook Name

blocks.19.mlp.hook_in

Architecture

transcoder

Context Size

8,192

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

å¸¸å¾·

-0.27

 Administration

-0.25

èĥĮ

-0.24

æĮŀ

-0.24

others

-0.24

/Admin

-0.24

otto

-0.24

isia

-0.23

 Others

-0.23

æĢĢ

-0.23

POSITIVE LOGITS

stance

0.29

urement

0.28

æĹ¥æĬ¥éģĵ

0.27

 anonymously

0.26

przed

0.26

æ·±å¤ľ

0.25

opot

0.25

åīįåįģ

0.24

pread

0.24

umer

0.24

Activations Density 0.025%

past tense verbs

No Comments

No Known Activations