INDEX

Explanations

either "like the" or abbreviations/short informal words

oai_token-act-pair · gemini-2.0-flash

media/entertainment

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

google/gemma-scope-2b-pt-transcoders/layer_3/width_16k/average_l0_54

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

16,384

Data Type

float32

Hook Name

blocks.3.ln2.hook_normalized

Architecture

jumprelu_transcoder

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

########.

-0.93

 بيها

-0.79

 مشين

-0.75

AutoScaleMode

-0.69

enumii

-0.67

رشف

-0.66

enumi

-0.65

 gangen

-0.64

NameInMap

-0.63

makeConstraints

-0.63

POSITIVE LOGITS

 like

2.17

like

1.94

Like

1.90

 Like

1.89

 LIKE

1.77

LIKE

1.70

 likes

1.28

 seperti

1.23

 Seperti

1.16

 như

1.15

Activations Density 0.500%

either "like the" or abbreviations/short informal words

media/entertainment

No Comments

No Known Activations