INDEX

Explanations

overall proportion of correctness

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 rates

0.46

構成

0.42

rolog

0.40

ormous

0.40

៩

0.40

 coating

0.39

 substantial

0.39

ivity

0.38

 injury

0.38

 desorption

0.38

POSITIVE LOGITS

 څه

0.39

 arşiv

0.39

 व्हाट्सएप

0.38

 Overall

0.38

ঘা

0.38

 правильно

0.38

좁

0.38

Successfully

0.38

 एकदा

0.37

wins

0.37

Activations Density 0.006%