INDEX

Explanations

references to children and their well-being

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

ctigges/pythia-70m-deduped__mlp-sm_processed/2-mlp-sm

Prompts (Dashboard)

32,768 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

32,768

Data Type

torch.float32

Hook Name

blocks.2.hook_mlp_out

Hook Layer

Architecture

standard

Context Size

128

Dataset

EleutherAI/the_pile_deduplicated

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ı

-3.11

¾

-3.01

Ļ

-3.00

º

-2.96

«

-2.94

Ľ

-2.93

Į

-2.91

İ

-2.84

ģ

-2.79

³

-2.79

POSITIVE LOGITS

forall

1.52

substack

1.52

\$

1.50

reon

1.49

valence

1.47

rl

1.45

retched

1.45

|}\

1.44

 Allah

1.42

compare

1.41

Activations Density 0.451%

references to children and their well-being

No Comments

No Known Activations

references to children and their well-being

No Comments

No Known Activations