INDEX

Explanations

understanding negative situations

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 proffered

0.42

awaited

0.41

 gladly

0.39

лега

0.39

 Eater

0.39

styling

0.38

achtree

0.37

 hatten

0.36

≋

0.36

탓

0.36

POSITIVE LOGITS

 waste

0.40

Waste

0.39

 Waste

0.38

 affect

0.38

goo

0.37

 Alps

0.36

waste

0.36

 excessive

0.35

 এখানে

0.35

 болезни

0.35

Activations Density 0.000%