INDEX

Explanations

promises and availability

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

疲

0.41

panel

0.40

そも

0.39

疑似

0.39

overleftarrow

0.39

 overcrow

0.38

})}

0.38

发现了

0.38

 দেখলে

0.38

😱

0.38

POSITIVE LOGITS

 promises

0.71

 promise

0.64

 promised

0.61

 Promises

0.57

 Promise

0.55

承诺

0.55

 обеща

0.55

promises

0.51

 anticipated

0.50

 वादा

0.50

Activations Density 0.021%