INDEX

Explanations

terms related to personal ownership and entities

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

ctigges/pythia-70m-deduped__res-sm_processed/4-res-sm

Prompts (Dashboard)

32,768 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

32,768

Data Type

torch.float32

Hook Name

blocks.4.hook_resid_post

Hook Layer

Architecture

standard

Context Size

128

Dataset

EleutherAI/the_pile_deduplicated

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ĥ½

-3.60

¦

-3.57

ĸ

-3.50

ĳ

-3.50

¶

-3.47

©

-3.43

↵

-3.41

<|padding|>

-3.41

<|outofrange|>

-3.41

↵

-3.41

POSITIVE LOGITS

 reasons

1.27

 explanation

1.27

 custody

1.24

uous

1.23

ariate

1.22

olia

1.21

estones

1.17

 consideration

1.17

 oldest

1.13

ensive

1.12

Activations Density 1.294%

terms related to personal ownership and entities

No Comments

No Known Activations

terms related to personal ownership and entities

No Comments

No Known Activations