INDEX

Explanations

mentions of geographical locations and significant historical or cultural entities

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Top Features by Cosine Similarity

Comparing With GEMMA-2-2B @ 5-gemmascope-res-16k

Configuration

google/gemma-scope-2b-pt-res/layer_5/width_16k/average_l0_68

Prompts (Dashboard)

36,864 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

16,384

Data Type

float32

Hook Name

blocks.5.hook_resid_post

Hook Layer

Architecture

jumprelu

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

||}

-0.81

 Dulles

-0.73

 Gogh

-0.72

Datuak

-0.72

migrationBuilder

-0.71

LikeLike

-0.71

keren

-0.69

pyplot

-0.69

__':

-0.69

Nal

-0.69

POSITIVE LOGITS

 Roskov

0.81

 deberes

0.79

mogat

0.78

 esclavos

0.78

 Lizzy

0.74

 étoient

0.74

ContentAlignment

0.73

 Elin

0.73

 bilinear

0.72

 brancas

0.71

Activations Density 2.712%

mentions of geographical locations and significant historical or cultural entities

No Comments

No Known Activations