INDEX

Explanations

neither

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

]--;

-0.70

 EconPapers

-0.61

yscy

-0.58

 nahilalakip

-0.58

 otomatig

-0.57

parsedMessage

-0.56

θρω

-0.54

fjspx

-0.54

 mosso

-0.53

RectangleBorder

-0.53

POSITIVE LOGITS

is

0.54

past

0.52

do

0.52

are

0.51

NewRow

0.50

 infallib

0.49

 negar

0.48

dor

0.45

boards

0.45

wegs

0.45

Activations Density 0.002%