INDEX

Explanations

"to be" verb forms

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

mwhanna/qwen3-4b-transcoders/layer_9.safetensors

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

163,840

Data Type

float32

Hook Name

blocks.9.mlp.hook_in

Architecture

transcoder

Context Size

8,192

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ãĥŀãĤ¤

-0.28

({↵↵

-0.25

 spit

-0.25

 ratified

-0.25

ç©¿è¡£

-0.24

untime

-0.24

 tabindex

-0.23

conversion

-0.23

ä¸ĬçĻ¾

-0.23

ullet

-0.23

POSITIVE LOGITS

utes

0.27

yps

0.27

edef

0.25

 Ð¿ÑĢÐ°Ð²Ð°

0.25

itionally

0.24

Red

0.24

çİ¯å¢ĥä¸Ń

0.24

æĹ¸

0.24

çİ¯å¢ĥä¸ĭ

0.24

-working

0.24

Activations Density 0.773%

"to be" verb forms

No Comments

No Known Activations