INDEX

Explanations

foreign

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

mwhanna/qwen3-4b-transcoders/layer_9.safetensors

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

163,840

Data Type

float32

Hook Name

blocks.9.mlp.hook_in

Architecture

transcoder

Context Size

8,192

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

foreign

-0.32

Foreign

-0.31

 Foreign

-0.31

 Bret

-0.29

 foreign

-0.29

_foreign

-0.29

ken

-0.26

æ®ĸ

-0.25

 FOREIGN

-0.25

Self

-0.25

POSITIVE LOGITS

éĹ´

0.29

ç»Ńçº¦

0.28

æľīæĽ´å¥½çļĦ

0.28

ç«¯

0.27

olut

0.27

arnation

0.26

equ

0.26

serter

0.26

çļĦæĪ¿åŃĲ

0.26

/=

0.25

Activations Density 0.623%

foreign

No Comments

No Known Activations