INDEX

Explanations

instances of the word "there."

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

ctigges/pythia-70m-deduped__mlp-sm_processed/0-mlp-sm

Prompts (Dashboard)

32,768 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

32,768

Data Type

torch.float32

Hook Name

blocks.0.hook_mlp_out

Hook Layer

Architecture

standard

Context Size

128

Dataset

EleutherAI/the_pile_deduplicated

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

abled

-1.63

 fewer

-1.59

'?"

-1.52

,''

-1.51

"?"

-1.45

amps

-1.44

eric

-1.43

"--

-1.42

msgstr

-1.42

 Wrote

-1.41

POSITIVE LOGITS

holding

1.65

fal

1.45

SUCCESS

1.39

PING

1.39

pping

1.38

front

1.34

uri

1.32

filling

1.31

holder

1.31

 disposable

1.30

Activations Density 0.549%

instances of the word "there."

No Comments

No Known Activations

instances of the word "there."

No Comments

No Known Activations