INDEX

Explanations

a mix of different characters, possibly non-English characters or symbols

oai_token-act-pair · gpt-3.5-turbo

New Auto-Interp

Configuration

neuronpedia/gpt2-small__res_scl-ajt/6-res_scl-ajt

Prompts (Dashboard)

12,288 prompts, 128 tokens each

Dataset (Dashboard)

Skylion007/openwebtext

Features

46,080

Data Type

torch.float32

Hook Point

blocks.6.hook_resid_pre

Architecture

standard

Context Size

128

Dataset

apollo-research/Skylion007-openwebtext-tokenizer-gpt2

Hook Point Layer

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Dragonbound

-0.98

oulos

-0.87

aneers

-0.79

espie

-0.77

 proximity

-0.75

itaire

-0.74

ACTED

-0.73

ettings

-0.73

isites

-0.71

theless

-0.71

POSITIVE LOGITS

´

1.78

¤

1.76

Ķ

1.71

ľ

1.70

Į

1.70

¹

1.69

¸

1.69

Ħ

1.68

ĥ

1.68

°

1.68

Activations Density 6.163%

a mix of different characters, possibly non-English characters or symbols

No Comments

No Known Activations

a mix of different characters, possibly non-English characters or symbols

No Comments

No Known Activations