INDEX

Explanations

sports-related terms and actions, especially related to American football

oai_token-act-pair · gpt-3.5-turbo

New Auto-Interp

Configuration

neuronpedia/gpt2-small__res_scl-ajt/6-res_scl-ajt

Prompts (Dashboard)

12,288 prompts, 128 tokens each

Dataset (Dashboard)

Skylion007/openwebtext

Features

46,080

Data Type

torch.float32

Hook Point

blocks.6.hook_resid_pre

Architecture

standard

Context Size

128

Dataset

apollo-research/Skylion007-openwebtext-tokenizer-gpt2

Hook Point Layer

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 unborn

-0.97

 metic

-0.86

 interns

-0.84

 clitor

-0.84

 clones

-0.83

 accur

-0.81

 flowering

-0.78

 cloning

-0.78

 purported

-0.77

 oppressed

-0.77

POSITIVE LOGITS

ï¸ı

1.34

----------------------------------------------------------------

1.12

Ther

1.06

conom

1.05

Certainly

1.03

Likewise

1.03

Because

1.01

Otherwise

1.01

ACP

1.01

âĶĢâĶĢ

1.01

Activations Density 7.251%

sports-related terms and actions, especially related to American football

No Comments

No Known Activations

sports-related terms and actions, especially related to American football

No Comments

No Known Activations