INDEX

Explanations

references to organizations, measurements, and scientific qualifications

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

neuronpedia/gpt2-small__res_slefr-ajt/2-res_slefr-ajt

Prompts (Dashboard)

12,288 prompts, 128 tokens each

Dataset (Dashboard)

Skylion007/openwebtext

Features

46,080

Data Type

torch.float32

Hook Point

blocks.2.hook_resid_pre

Architecture

standard

Context Size

128

Dataset

apollo-research/Skylion007-openwebtext-tokenizer-gpt2

Hook Point Layer

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

mates

-0.73

 Hayden

-0.70

mallow

-0.69

bell

-0.69

Sne

-0.68

 Spiegel

-0.66

flies

-0.64

tem

-0.64

point

-0.63

chester

-0.63

POSITIVE LOGITS

otiation

0.77

ayers

0.76

INS

0.76

ource

0.74

allery

0.72

ointment

0.71

hari

0.71

inx

0.71

oing

0.69

isodes

0.69

Activations Density 0.109%

references to organizations, measurements, and scientific qualifications

No Comments

No Known Activations