INDEX

Explanations

Okay, let's break down

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

很好的

0.24

 mixture

0.24

 кор

0.23

 veces

0.23

 kupu

0.23

പക്ഷ

0.23

 ballpark

0.23

pathetic

0.22

കം

0.22

䇞

0.22

POSITIVE LOGITS

Ř

0.25

 having

0.25

 accessing

0.24

 THIS

0.23

యాన్ని

0.23

 waarin

0.23

 iemand

0.22

ኖ

0.22

any

0.22

阎

0.22

Activations Density 0.127%