INDEX

Explanations

symbols or formatting elements

oai_token-act-pair · gpt-3.5-turbo

instances of symbols or representations of political or social movements

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Top Features by Cosine Similarity

Comparing With GPT2-SMALL @ 10-res-jb

Configuration

jbloom/GPT2-Small-SAEs-Reformatted/blocks.10.hook_resid_pre

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

Skylion007/openwebtext

Features

24,576

Data Type

torch.float32

Hook Point

blocks.10.hook_resid_pre

Architecture

standard

Context Size

128

Dataset

Skylion007/openwebtext

Hook Point Layer

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Manz

-0.64

dod

-0.64

 Jeanne

-0.64

 Izan

-0.64

 Bris

-0.62

 shroud

-0.61

sters

-0.61

 Shelter

-0.61

ABE

-0.61

 sacrific

-0.61

POSITIVE LOGITS

ª

1.45

Ĵ

1.39

Ĳ

1.33

«

1.24

¹

1.20

ı

1.20

ĸ

1.18

³

1.17

ĳ

1.17

Ķ

1.17

Activations Density 0.090%

symbols or formatting elements

instances of symbols or representations of political or social movements

No Comments

No Known Activations