INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

ctigges/pythia-70m-deduped__mlp-sm_processed/0-mlp-sm

Prompts (Dashboard)

32,768 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

32,768

Data Type

torch.float32

Hook Name

blocks.0.hook_mlp_out

Hook Layer

Architecture

standard

Context Size

128

Dataset

EleutherAI/the_pile_deduplicated

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

dll

-1.82

Ã¨re

-1.80

kwargs

-1.65

dock

-1.60

 enough

-1.53

tk

-1.53

StackTrace

-1.49

Ã¨res

-1.47

panic

-1.43

 fatal

-1.42

POSITIVE LOGITS

ament

1.58

 Balance

1.57

idity

1.55

liness

1.48

 rating

1.48

 worn

1.48

 measuring

1.47

balance

1.45

ighed

1.44

 percentage

1.44

Activations Density 3.048%

No Known Activations

This feature has no known activations.