INDEX

Explanations

terms related to linguistic modal verbs and accompanying grammatical properties

oai_token-act-pair · gpt-3.5-turbo

New Auto-Interp

Configuration

neuronpedia/gpt2-small__res_scl-ajt/6-res_scl-ajt

Prompts (Dashboard)

12,288 prompts, 128 tokens each

Dataset (Dashboard)

Skylion007/openwebtext

Features

46,080

Data Type

torch.float32

Hook Point

blocks.6.hook_resid_pre

Architecture

standard

Context Size

128

Dataset

apollo-research/Skylion007-openwebtext-tokenizer-gpt2

Hook Point Layer

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ï¸

-0.92

 Lauder

-0.89

ģĸ

-0.87

 Beir

-0.81

 Haram

-0.77

 Bulldogs

-0.77

 Unic

-0.77

¨

-0.76

 Quarterly

-0.75

 Baptist

-0.75

POSITIVE LOGITS

icum

1.53

ifiable

1.49

ding

1.47

elled

1.41

ifiers

1.39

ifications

1.38

ded

1.37

ules

1.36

ifies

1.34

ulo

1.29

Activations Density 7.560%

terms related to linguistic modal verbs and accompanying grammatical properties

No Comments

No Known Activations

terms related to linguistic modal verbs and accompanying grammatical properties

No Comments

No Known Activations