INDEX

Explanations

references to wrestling and combat sports

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

ctigges/pythia-70m-deduped__mlp-sm_processed/0-mlp-sm

Prompts (Dashboard)

32,768 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

32,768

Data Type

torch.float32

Hook Name

blocks.0.hook_mlp_out

Hook Layer

Architecture

standard

Context Size

128

Dataset

EleutherAI/the_pile_deduplicated

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ago

-1.73

zes

-1.51

aging

-1.49

orie

-1.48

utt

-1.47

 reviewing

-1.40

 bowel

-1.40

anus

-1.40

quiry

-1.38

agers

-1.36

POSITIVE LOGITS

nuts

2.06

doms

1.95

)|$(

1.75

hurst

1.68

nut

1.63

place

1.58

pole

1.55

gered

1.53

à°¿

1.51

 temples

1.51

Activations Density 0.025%

references to wrestling and combat sports

No Comments

No Known Activations

references to wrestling and combat sports

No Comments

No Known Activations