INDEX

Explanations

redundant or contrasting concepts

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Magazine

0.46

Currently

0.43

 mujeres

0.42

かしい

0.41

Ba

0.40

dre

0.40

hola

0.40

č

0.39

 Roberto

0.39

POSITIVE LOGITS

สำหรับการ

0.50

សម្រាប់ការ

0.50

 nerfs

0.48

য়োজনীয়

0.46

चर्स

0.43

 desks

0.43



0.43

 guarant

0.42

 liability

0.42

ไฮ

0.41

Activations Density 0.001%