INDEX

Explanations

mentions of names of cities or locations

oai_token-act-pair · gpt-3.5-turbo

New Auto-Interp

Configuration

neuronpedia/gpt2-small__res_scefr-ajt/6-res_scefr-ajt

Prompts (Dashboard)

12,288 prompts, 128 tokens each

Dataset (Dashboard)

Skylion007/openwebtext

Features

46,080

Data Type

torch.float32

Hook Point

blocks.6.hook_resid_pre

Architecture

standard

Context Size

128

Dataset

apollo-research/Skylion007-openwebtext-tokenizer-gpt2

Hook Point Layer

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

vertisements

-0.85

gins

-0.81

row

-0.76

Reviewer

-0.76

orters

-0.75

istration

-0.74

thood

-0.74

road

-0.73

intern

-0.73

angers

-0.73

POSITIVE LOGITS

£ı

0.88

 Leban

0.85

ION

0.85

 EDITION

0.85

IDER

0.83

 Lumpur

0.83

THR

0.83

ABLE

0.81

PRESS

0.81

BRA

0.81

Activations Density 0.430%

mentions of names of cities or locations

No Comments

No Known Activations