INDEX

Explanations

mathematical equations or formal notations

oai_token-act-pair · gpt-3.5-turbo

mathematical equations or expressions

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Top Features by Cosine Similarity

Comparing With GPT2-SMALL @ 8-res-jb

Configuration

jbloom/GPT2-Small-SAEs-Reformatted/blocks.8.hook_resid_pre

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

Skylion007/openwebtext

Features

24,576

Data Type

torch.float32

Hook Point

blocks.8.hook_resid_pre

Architecture

standard

Context Size

128

Dataset

Skylion007/openwebtext

Hook Point Layer

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 livest

-0.85

eness

-0.82

itage

-0.82

 nodd

-0.78

igating

-0.76

wright

-0.73

SPONSORED

-0.71

esis

-0.70

imony

-0.67

ovan

-0.67

POSITIVE LOGITS

========

1.62

============

1.50

===

1.09

ãĥīãĥ©ãĤ´ãĥ³

0.83

 TRUE

0.78

ãĥ´ãĤ¡

0.74

ãĤ¨ãĥ«

0.72

 False

0.72

 FALSE

0.71

 infinity

0.68

Activations Density 0.021%

mathematical equations or formal notations

mathematical equations or expressions

No Comments

No Known Activations

mathematical equations or formal notations

mathematical equations or expressions

No Comments

No Known Activations