INDEX

Explanations

words related to difficulty or challenges

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Top Features by Cosine Similarity

Comparing With GEMMA-2-9B @ 0-gemmascope-res-16k

Configuration

google/gemma-scope-9b-pt-res/layer_0/width_16k/average_l0_129

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

16,384

Data Type

float32

Hook Name

blocks.0.hook_resid_post

Hook Layer

Architecture

jumprelu

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

<unused68>

-0.94

<unused8>

-0.94

<unused3>

-0.94

[@BOS@]

-0.94

<unused52>

-0.94

<unused79>

-0.94

<unused28>

-0.93

<unused41>

-0.93

<unused14>

-0.93

<pad>

-0.93

POSITIVE LOGITS

EventHandler

0.50

↵

0.38

<em>

0.36

 Water

0.35

↵↵

0.35

util

0.34

 useState

0.34

"[

0.34

Phi

0.33

Util

0.32

Activations Density 0.272%

words related to difficulty or challenges

No Comments

No Known Activations

words related to difficulty or challenges

No Comments

No Known Activations