INDEX

Explanations

"ban the box" or lists

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Genuine

0.54

New

0.53

 genuine

0.52

𝔬

0.51

Lie

0.50

genuine

0.49

Regular

0.48

𝙊

0.47

Only

0.46

𝘪

0.46

POSITIVE LOGITS

räume

0.45

rl

0.38

 Arth

0.38

Yao

0.38

Dua

0.37

랴

0.37

潸

0.37

 Enrollment

0.37

 thăm

0.36

Dey

0.36

Activations Density 0.000%