INDEX

Explanations

explaining quality and its dependencies

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Quatre

0.49

狨

0.41

ربية

0.39

ранд

0.38

 Vide

0.38

 WATER

0.38

 Alpen

0.38

伈

0.38

 näher

0.37

 videomuz

0.37

POSITIVE LOGITS

some

0.53

you

0.51

only

0.50

iary

0.50

ing

0.50

you

0.49

 trivial

0.49

 attenuation

0.47

table

0.47

trivial

0.46

Activations Density 0.006%