INDEX

Explanations

negative situations

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

DockStyle

-0.93

WireFormatLite

-0.91

 متعلقه

-0.87

出版年

-0.78

 Biôgrafia

-0.75

ergies

-0.71

IsMutable

-0.71

 autorytatywna

-0.69

 distanciation

-0.69

 समीक्षक

-0.67

POSITIVE LOGITS

or

0.73

 negative

0.59

but

0.57

 caused

0.57

 causing

0.56

 worse

0.55

yet

0.54

 dangerous

0.49

,'

0.49

--

0.48

Activations Density 0.033%