INDEX

Explanations

phrases focusing on ease of use and simplifying complex processes

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

ctigges/pythia-70m-deduped__res-sm_processed/4-res-sm

Prompts (Dashboard)

32,768 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

32,768

Data Type

torch.float32

Hook Name

blocks.4.hook_resid_post

Hook Layer

Architecture

standard

Context Size

128

Dataset

EleutherAI/the_pile_deduplicated

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ĻĤ

-5.54

Ħ

-5.28

İ

-5.05

¿½

-4.99

±

-4.89

¯

-4.87

£

-4.78

į

-4.75

»¿

-4.73

¸

-4.72

POSITIVE LOGITS

 easier

2.94

 safer

2.60

 harder

2.45

 attractive

2.39

 easy

2.36

 obsolete

2.30

 impossible

2.24

 difficult

2.24

 possible

2.24

 extremely

2.23

Activations Density 0.664%

phrases focusing on ease of use and simplifying complex processes

No Comments

No Known Activations

phrases focusing on ease of use and simplifying complex processes

No Comments

No Known Activations