INDEX

Explanations

references to knowledge and understanding

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Top Features by Cosine Similarity

Comparing With GEMMA-2-2B @ 15-gemmascope-res-16k

Configuration

google/gemma-scope-2b-pt-res/layer_15/width_16k/average_l0_78

Prompts (Dashboard)

36,864 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

16,384

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Hook Layer

Architecture

jumprelu

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

WriteTagHelper

-0.67

 Larsson

-0.65

dhist

-0.61

qxd

-0.60

 Borja

-0.60

Tos

-0.59

 Pfalz

-0.59

ábbi

-0.58

 Alain

-0.57

DotNetBar

-0.57

POSITIVE LOGITS

know

1.44

 know

1.42

 Know

1.40

Know

1.39

KNOW

1.35

 knows

1.34

 KNOW

1.30

 Knows

1.22

knows

1.21

knew

1.18

Activations Density 0.132%

references to knowledge and understanding

No Comments

No Known Activations