INDEX

Explanations

proper nouns and locations

oai_token-act-pair · gpt-3.5-turbo Triggered by @bot

New Auto-Interp

Top Features by Cosine Similarity

Comparing With GPT2-SMALL @ 8-res_fs1536-jb

Configuration

jbloom/GPT2-Small-Feature-Splitting-Experiment-Layer-8/blocks.8.hook_resid_pre_1536

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

Skylion007/openwebtext

Features

1,536

Data Type

torch.float32

Hook Point

blocks.8.hook_resid_pre

Architecture

standard

Context Size

128

Dataset

Skylion007/openwebtext

Hook Point Layer

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 steep

-0.62

 scarcity

-0.60

 worn

-0.58

 allowances

-0.57

 heav

-0.57

 commod

-0.57

 mainline

-0.57

 discriminating

-0.56

 scra

-0.56

 brace

-0.55

POSITIVE LOGITS

icz

1.19

ovsky

1.01

akis

0.97

ois

0.96

ean

0.96

ansky

0.95

yk

0.95

ove

0.94

anski

0.94

aja

0.94

Activations Density 1.971%

proper nouns and locations

No Comments

No Known Activations

proper nouns and locations

No Comments

No Known Activations