INDEX

Explanations

Concluding lists of options

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 mysticism

0.38

 allegory

0.38

𝔀

0.38

 sincerity

0.37

 erste

0.37

еме

0.37

 eagerness

0.36

ҳ

0.36

 estet

0.35

 aest

0.35

POSITIVE LOGITS

 These

0.59

 Lastly

0.57

 Finally

0.55

いずれ

0.55

 None

0.52

These

0.50

 इनमें

0.49

 finally

0.49

 どれ

0.48

 Choosing

0.48

Activations Density 0.412%