INDEX

Explanations

comparing models or methods

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

0.44

 subsequent

0.42

 subsequently

0.41

$\

0.40

 named

0.39

 indel

0.39

 naming

0.38

 lesion

0.38

$[\

0.38

 accolade

0.37

POSITIVE LOGITS

했지만

0.52

 originalmente

0.49

 nhưng

0.48

 даже

0.46

ພວກເຮ

0.46

 기술

0.45

 온도

0.45

iliśmy

0.45

Temperatura

0.45

 dovoljno

0.45

Activations Density 0.005%