INDEX

Explanations

references to historical events, political figures, and international relations

oai_token-act-pair · gpt-3.5-turbo

New Auto-Interp

Configuration

neuronpedia/gpt2-small__res_scefr-ajt/6-res_scefr-ajt

Prompts (Dashboard)

12,288 prompts, 128 tokens each

Dataset (Dashboard)

Skylion007/openwebtext

Features

46,080

Data Type

torch.float32

Hook Point

blocks.6.hook_resid_pre

Architecture

standard

Context Size

128

Dataset

apollo-research/Skylion007-openwebtext-tokenizer-gpt2

Hook Point Layer

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

icter

-0.97

oscope

-0.97

emet

-0.91

effic

-0.87

İĭ

-0.85

udeb

-0.83

BILITIES

-0.82

orks

-0.82

GGGGGGGG

-0.81

aida

-0.80

POSITIVE LOGITS

tenance

1.07

 alike

1.02

 hybrid

0.97

 complexes

0.96

 hybrids

0.96

byter

0.95

osterone

0.95

 Relations

0.88

âĦ¢:

0.87

ption

0.86

Activations Density 2.949%

references to historical events, political figures, and international relations

No Comments

No Known Activations

references to historical events, political figures, and international relations

No Comments

No Known Activations