INDEX

Explanations

for clarity, simplicity, demonstration

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ostino

0.70

olini

0.70

êng

0.66

大量

0.66

olulu

0.65

utama

0.64

解決

0.63

一系列

0.60

 পের

0.60

훌

0.60

POSITIVE LOGITS

 completeness

1.71

 sake

1.57

 clarity

1.47

 illustrative

1.44

 convenience

1.39

为了

1.31

 simplicity

1.31

 illustration

1.23

 demonstration

1.22

 为了

1.20

Activations Density 0.239%