INDEX

Explanations

safety and reliability

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

(;;)

-0.78

Hauptartikel

-0.78

NUMX

-0.77

 Transfer

-0.75

Transfer

-0.74

inghouse

-0.73

()")

-0.71

ⓧ

-0.71

Datuak

-0.69

;;)

-0.69

POSITIVE LOGITS

ized

0.47

lar

0.43

ged

0.43

ed

0.42

 же

0.41

რო

0.40

docx

0.39

 TextAlign

0.38

 iprot

0.38

Activations Density 0.617%