INDEX

Explanations

evaluation outcomes: positive, negative, well, poorly

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ពិសេស

0.52

depth

0.52

aisia

0.49

}$.

0.48

 liters

0.48

იმე

0.48

theater

0.47

pagination

0.47

 bosom

0.47

})$.

0.46

POSITIVE LOGITS

стройство

0.58

的

0.57

any

0.55

 cualquier

0.55

 When

0.52

WHEN

0.52

ドライ

0.51

看到

0.50

Quando

0.50

сным

0.50

Activations Density 0.031%