INDEX

Explanations

instances of the word "innovative."

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

ctigges/pythia-70m-deduped__mlp-sm_processed/0-mlp-sm

Prompts (Dashboard)

32,768 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

32,768

Data Type

torch.float32

Hook Name

blocks.0.hook_mlp_out

Hook Layer

Architecture

standard

Context Size

128

Dataset

EleutherAI/the_pile_deduplicated

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

enson

-1.60

 shelf

-1.54

oto

-1.51

auer

-1.48

ene

-1.42

arden

-1.39

heimer

-1.37

.“

-1.33

 Prize

-1.32

atche

-1.30

POSITIVE LOGITS

ł

2.51

¢

2.47

Īĺ

2.21

ķ

2.20

¯

2.12

ļ

2.07

Ŀ

2.04

ŀ

2.02

ĸ

2.02

Ħ

1.98

Activations Density 0.019%

instances of the word "innovative."

No Comments

No Known Activations

instances of the word "innovative."

No Comments

No Known Activations