INDEX

Explanations

if

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

AnchorStyles

-0.83

antaranya

-0.81

Datuak

-0.81

horabuena

-0.79

 bezeichneter

-0.75

 Мексичка

-0.74

ParallelGroup

-0.73

 فريبيس

-0.73

 BorderRadius

-0.73

 poveznice

-0.73

POSITIVE LOGITS

an

0.75

0.74

the

0.74

 there

0.65

 needed

0.60

 left

0.60

one

0.59

 little

0.58

not

0.58

 patients

0.57

Activations Density 0.067%