INDEX

Explanations

references to the Federal Reserve

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

ctigges/pythia-70m-deduped__mlp-sm_processed/0-mlp-sm

Prompts (Dashboard)

32,768 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

32,768

Data Type

torch.float32

Hook Name

blocks.0.hook_mlp_out

Hook Layer

Architecture

standard

Context Size

128

Dataset

EleutherAI/the_pile_deduplicated

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

lasses

-1.58

 fair

-1.55

 merry

-1.43

amais

-1.42

 merc

-1.40

'?"

-1.40

|$.

-1.38

'>

-1.38

 sess

-1.37

hab

-1.37

POSITIVE LOGITS

icum

2.06

erd

1.92

bank

1.84

etable

1.66

ilee

1.63

craft

1.60

fecture

1.59

forge

1.59

IPE

1.58

encial

1.56

Activations Density 0.432%

references to the Federal Reserve

No Comments

No Known Activations

references to the Federal Reserve

No Comments

No Known Activations