INDEX

Explanations

is followed by article or defined

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵

0.36

ξύ

0.33

↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵

0.32

 حالا

0.32

istor

0.31

 haters

0.31

 screenshot

0.30

</

0.30

疏

0.30

0.29

POSITIVE LOGITS

一种

0.64

een

0.61

一種

0.57

 eine

0.56

一款

0.52

 fundada

0.50

 einen

0.50

Located

0.49

 located

0.48

located

0.48

Activations Density 0.002%