INDEX

Explanations

identify unexpected findings

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 pasan

0.47

 حسین

0.47

 precisar

0.47

담

0.46

 spain

0.45

 reprendre

0.45

 richesse

0.45

 මෙ

0.45

 riqueza

0.44

◽

0.44

POSITIVE LOGITS

Webpack

0.51

glie

0.47

תו

0.46

 Single

0.46

 Weights

0.42

 Early

0.41

भारत

0.41

 Faster

0.41

Thời

0.41

Single

0.39

Activations Density 0.000%