INDEX

Explanations

introduces examples or general statements

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

lays

0.39

 crushes

0.38

 sells

0.37

 months

0.36

ɓ

0.36

icies

0.36

adies

0.36

收取

0.36

 kilograms

0.36

 billboards

0.35

POSITIVE LOGITS

实用

0.46

 सटीक

0.42

긔

0.41

옛

0.41

 Useful

0.40

 useful

0.40

 vrlo

0.39

 auquel

0.39

 Typical

0.39

 helpful

0.39

Activations Density 0.003%