INDEX

Explanations

adverbs describing actions

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

mwhanna/qwen3-4b-transcoders/layer_19.safetensors

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

163,840

Data Type

float32

Hook Name

blocks.19.mlp.hook_in

Architecture

transcoder

Context Size

8,192

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Genre

-0.27

ãģ«æĪ»

-0.26

baum

-0.26

chemy

-0.26

é¹ĺ

-0.25

ãģ¨ãģ«ãģĭãģı

-0.25

çĹĺ

-0.25

 Tart

-0.24

æŃ»åİ»

-0.24

 reverted

-0.23

POSITIVE LOGITS

edBy

0.28

 delights

0.26

ç½®

0.26

ilm

0.26

antly

0.25

cue

0.25

FullYear

0.25

åħ¥åŃ¦

0.25

 maxim

0.25

oref

0.25

Activations Density 0.018%

adverbs describing actions

No Comments

No Known Activations