INDEX

Explanations

cial

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 pleaſure

-1.06

 itſelf

-0.98

 Monfieur

-0.94

 Jefus

-0.89

?),

-0.89

ſy

-0.87

 Majefty

-0.86

?).

-0.84

 ſche

-0.84

 muſt

-0.83

POSITIVE LOGITS

be

0.67

 continue

0.60

 prepare

0.59

awaiter

0.57

 assist

0.55

 manage

0.55

 support

0.52

 help

0.52

not

0.51

 even

0.51

Activations Density 0.011%