INDEX

Explanations

personal statements where the speaker expresses their thoughts or feelings about something

oai_token-act-pair · gpt-3.5-turbo

expressions of self-identity and the speaker's personal experiences

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Top Features by Cosine Similarity

Comparing With GPT2-SMALL @ 0-res-jb

Configuration

jbloom/GPT2-Small-SAEs-Reformatted/blocks.0.hook_resid_pre

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

Skylion007/openwebtext

Features

24,576

Data Type

torch.float32

Hook Point

blocks.0.hook_resid_pre

Architecture

standard

Context Size

128

Dataset

Skylion007/openwebtext

Hook Point Layer

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Kut

-0.70

boards

-0.67

iquette

-0.66

 Rolls

-0.62

 Razor

-0.62

theless

-0.61

Snake

-0.60

birds

-0.60

 Shel

-0.59

bones

-0.58

POSITIVE LOGITS

am

3.38

Am

1.74

Am

1.67

pm

1.59

AM

1.57

'm

1.56

am

1.39

AM

1.19

amic

0.98

im

0.95

Activations Density 0.033%

personal statements where the speaker expresses their thoughts or feelings about something

expressions of self-identity and the speaker's personal experiences

No Comments

No Known Activations

personal statements where the speaker expresses their thoughts or feelings about something

expressions of self-identity and the speaker's personal experiences

No Comments

No Known Activations