INDEX

Explanations

apology and our

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

かっこ

0.47

 klassischen

0.43

 kreativ

0.43

 beeindruck

0.42

を実行

0.42

 postulated

0.42

<0x00>

0.41

 sinnvoll

0.41

𝕌

0.41

憧

0.41

POSITIVE LOGITS

 unfortunately

0.59

 сожалению

0.58

 apologize

0.56

 apologies

0.54

 हमारा

0.53

 apologise

0.53

 हमारी

0.52

 apology

0.52

确实

0.52

our

0.51

Activations Density 0.033%