INDEX

Explanations

hallucinations and LLM properties

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 portátil

0.48

 Swarovski

0.43

 cargos

0.43

 nascimento

0.43

achta

0.43

 líquido

0.42

vať

0.42

 neodymium

0.41

鐲

0.41

 vendidos

0.41

POSITIVE LOGITS

 describing

0.47

History

0.43

Needed

0.42

 HISTORY

0.42

Ghost

0.42

ظام

0.41

Comple

0.41

 protects

0.41

 chapter

0.40

 meaningfully

0.40

Activations Density 0.001%