INDEX

Explanations

avoiding certain types of questions

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 annihilated

0.45

%,

0.41

䚰

0.41

ರವಾಗಿ

0.40

 入り

0.39

मर्द

0.38

iyas

0.38

iyam

0.37

ಧ

0.37

][%

0.37

POSITIVE LOGITS

 Upper

0.44

 నె

0.42

 Cocoa

0.42

 planters

0.40

ปรับ

0.37

 planter

0.37

 upper

0.35

 ప్రశ్

0.35

 Hospice

0.35

 Lower

0.35

Activations Density 0.002%