INDEX

Explanations

Pokémon names and related information

oai_token-act-pair · gpt-3.5-turbo

New Auto-Interp

Configuration

neuronpedia/gpt2-small__res_scefr-ajt/6-res_scefr-ajt

Prompts (Dashboard)

12,288 prompts, 128 tokens each

Dataset (Dashboard)

Skylion007/openwebtext

Features

46,080

Data Type

torch.float32

Hook Point

blocks.6.hook_resid_pre

Architecture

standard

Context Size

128

Dataset

apollo-research/Skylion007-openwebtext-tokenizer-gpt2

Hook Point Layer

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Penal

-0.36

 rehe

-0.36

 saints

-0.34

assed

-0.34

 fasting

-0.32

 booted

-0.32

 sins

-0.31

 subscript

-0.31

 Saints

-0.30

IRA

-0.30

POSITIVE LOGITS

sembly

0.41

atform

0.38

erie

0.35

DF

0.35

science

0.34

ãĤ´ãĥ³

0.33

xual

0.33

intend

0.33

date

0.32

 Plants

0.32

Activations Density 0.011%

Pokémon names and related information

No Comments

No Known Activations

Pokémon names and related information

No Comments

No Known Activations