INDEX

Explanations

references to "CO," indicating affiliations with companies or corporations

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

ctigges/pythia-70m-deduped__mlp-sm_processed/2-mlp-sm

Prompts (Dashboard)

32,768 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

32,768

Data Type

torch.float32

Hook Name

blocks.2.hook_mlp_out

Hook Layer

Architecture

standard

Context Size

128

Dataset

EleutherAI/the_pile_deduplicated

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

enez

-2.11

enos

-2.03

itol

-1.87

enne

-1.86

enant

-1.80

iÃ¨re

-1.73

ienn

-1.72

ento

-1.69

ians

-1.69

ienne

-1.68

POSITIVE LOGITS

 pore

1.46

ths

1.45

 [...]

1.38

 Physical

1.37

ations

1.36

 acute

1.36

 thoughts

1.36

 pathological

1.35

 Majesty

1.34

 seriously

1.32

Activations Density 0.297%

references to "CO," indicating affiliations with companies or corporations

No Comments

No Known Activations

references to "CO," indicating affiliations with companies or corporations

No Comments

No Known Activations