INDEX

Explanations

mentions of the word "Est" and variations indicating a specific focus or title

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

neuronpedia/gpt2-small__res_slefr-ajt/2-res_slefr-ajt

Prompts (Dashboard)

12,288 prompts, 128 tokens each

Dataset (Dashboard)

Skylion007/openwebtext

Features

46,080

Data Type

torch.float32

Hook Point

blocks.2.hook_resid_pre

Architecture

standard

Context Size

128

Dataset

apollo-research/Skylion007-openwebtext-tokenizer-gpt2

Hook Point Layer

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

bur

-0.86

strap

-0.84

SpaceEngineers

-0.80

BALL

-0.80

FACE

-0.80

spr

-0.79

beit

-0.78

die

-0.74

 Crusade

-0.73

trap

-0.73

POSITIVE LOGITS

ablished

1.27

imate

1.24

ablish

1.22

imates

1.19

asonic

1.00

uary

1.00

Est

0.96

imating

0.92

imize

0.90

Est

0.87

Activations Density 0.011%

mentions of the word "Est" and variations indicating a specific focus or title

No Comments

No Known Activations