INDEX

Explanations

5 minutes or disclaimer

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

批量

0.46

ert

0.44

いません

0.43

gye

0.42

thew

0.42

 convexo

0.42

गिर

0.42

法施行

0.41

ｎ

0.41

स्

0.41

POSITIVE LOGITS

 разных

0.51

 Edith

0.48

фан

0.45

 Rhodes

0.43

rma

0.43

 Oatmeal

0.43

 Anita

0.42

 Revolutionary

0.42

чнее

0.42

 Bitte

0.42

Activations Density 0.004%