INDEX

Explanations

proper nouns like names of people and places

oai_token-act-pair · gpt-3.5-turbo

New Auto-Interp

Configuration

neuronpedia/gpt2-small__res_scl-ajt/6-res_scl-ajt

Prompts (Dashboard)

12,288 prompts, 128 tokens each

Dataset (Dashboard)

Skylion007/openwebtext

Features

46,080

Data Type

torch.float32

Hook Point

blocks.6.hook_resid_pre

Architecture

standard

Context Size

128

Dataset

apollo-research/Skylion007-openwebtext-tokenizer-gpt2

Hook Point Layer

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 guiName

-0.84

();

-0.79

Dialogue

-0.77

[];

-0.75

().

-0.72

+.

-0.65

{{

-0.64

 ende

-0.63

."

-0.62

('

-0.62

POSITIVE LOGITS

),"

1.54

)'

1.51

)"

1.43

)."

1.42

1.41

)",

1.35

)=

1.26

?)

1.25

)[

1.25

)/

1.22

Activations Density 9.413%

proper nouns like names of people and places

No Comments

No Known Activations

proper nouns like names of people and places

No Comments

No Known Activations