INDEX

Explanations

by definition

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

.!

0.36

 ഒന്നും

0.35

гим

0.35

Demo

0.34

֒

0.34

 یہی

0.32

!“

0.32

 такие

0.31

 которые

0.31

+}^{

0.30

POSITIVE LOGITS

 importantly

0.68

 crucially

0.60

 frankly

0.58

 however

0.56

 conversely

0.56

 predictably

0.52

 quite

0.49

 ironically

0.49

 unsurprisingly

0.49

 additionally

0.49

Activations Density 0.051%