INDEX

Explanations

references to individuals' names and identities

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

ctigges/pythia-70m-deduped__mlp-sm_processed/2-mlp-sm

Prompts (Dashboard)

32,768 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

32,768

Data Type

torch.float32

Hook Name

blocks.2.hook_mlp_out

Hook Layer

Architecture

standard

Context Size

128

Dataset

EleutherAI/the_pile_deduplicated

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

**]{}

-1.85

:**]{}

-1.69

¼

-1.69

”,

-1.61

»¿

-1.61

½

-1.58

Ĥ¬

-1.57

”—

-1.55

”

-1.55

**]{},

-1.54

POSITIVE LOGITS

hip

2.18

imes

1.73

yy

1.68

ially

1.68

hips

1.59

specifically

1.58

after

1.52

idades

1.51

rs

1.51

obs

1.51

Activations Density 0.492%

references to individuals' names and identities

No Comments

No Known Activations

references to individuals' names and identities

No Comments

No Known Activations