INDEX

Explanations

exact or precise instances or descriptions

oai_token-act-pair · gpt-3.5-turbo

New Auto-Interp

Configuration

neuronpedia/gpt2-small__res_scefr-ajt/6-res_scefr-ajt

Prompts (Dashboard)

12,288 prompts, 128 tokens each

Dataset (Dashboard)

Skylion007/openwebtext

Features

46,080

Data Type

torch.float32

Hook Point

blocks.6.hook_resid_pre

Architecture

standard

Context Size

128

Dataset

apollo-research/Skylion007-openwebtext-tokenizer-gpt2

Hook Point Layer

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ker

-1.25

asta

-1.09

olyn

-1.06

kers

-1.03

isson

-1.00

roe

-0.99

 Tome

-0.98

cffff

-0.97

 Mushroom

-0.97

oly

-0.97

POSITIVE LOGITS

 aligned

1.05

ãĤ¨

1.04

 wrong

1.01

 calibrated

0.98

 suited

0.96

 tuned

0.95

 opposite

0.94

 matched

0.93

 align

0.93

 positioned

0.92

Activations Density 0.432%

exact or precise instances or descriptions

No Comments

No Known Activations

exact or precise instances or descriptions

No Comments

No Known Activations