INDEX

Explanations

prime followed by specific terms

The neuron triggers primarily on occurrences of the word “Prime.”

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 lowa

-0.83

 complying

-0.81

↵↵

-0.77

 mites

-0.76

érèse

-0.76

 amplify

-0.75

 broaden

-0.75

IFR

-0.75

 mics

-0.74

 verfügen

-0.73

POSITIVE LOGITS

 mover

1.59

Prime

1.52

 factorization

1.49

 minister

1.46

 Prime

1.42

val

1.40

prime

1.39

 Minister

1.38

 prime

1.23

Minister

1.23

Activations Density 0.013%