INDEX

Explanations

adjectives and descriptors related to quality and appearance

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

neuronpedia/gpt2-small__res_slefr-ajt/2-res_slefr-ajt

Prompts (Dashboard)

12,288 prompts, 128 tokens each

Dataset (Dashboard)

Skylion007/openwebtext

Features

46,080

Data Type

torch.float32

Hook Point

blocks.2.hook_resid_pre

Architecture

standard

Context Size

128

Dataset

apollo-research/Skylion007-openwebtext-tokenizer-gpt2

Hook Point Layer

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

orio

-0.52

ĪĴ

-0.49

ooter

-0.47

pmwiki

-0.45

©¶æ

-0.44

initions

-0.43

olars

-0.42

heading

-0.40

BACK

-0.40

occupied

-0.40

POSITIVE LOGITS

 Atmosp

0.44

 Seym

0.44

 Brewer

0.38

 Hawk

0.38

 Fresh

0.37

tis

0.37

 Prosper

0.36

 Mell

0.36

 Norn

0.36

helle

0.36

Activations Density 0.007%

adjectives and descriptors related to quality and appearance

No Comments

No Known Activations