INDEX

Explanations

No Explanations Found

New Auto-Interp

Top Features by Cosine Similarity

Comparing With GPT2-SMALL @ 2-res-jb

Configuration

jbloom/GPT2-Small-SAEs-Reformatted/blocks.2.hook_resid_pre

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

Skylion007/openwebtext

Features

24,576

Data Type

torch.float32

Hook Point

blocks.2.hook_resid_pre

Architecture

standard

Context Size

128

Dataset

Skylion007/openwebtext

Hook Point Layer

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 frequ

-0.69

 burst

-0.68

 sexually

-0.63

avg

-0.63

fully

-0.63

 tremend

-0.62

fur

-0.61

 smoker

-0.61

 cruise

-0.61

 auction

-0.60

POSITIVE LOGITS

ãĥīãĥ©

0.73

umbs

0.71

itches

0.67

 Gate

0.67

okemon

0.66

ramids

0.65

 Chronicles

0.65

RECT

0.65

osure

0.64

OUN

0.64

Activations Density 0.000%

No Known Activations

This feature has no known activations.