INDEX

Explanations

documentation strings

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

$),

0.62

 supposedly

0.62

렸

0.61

'$,

0.61

ΠΑ

0.61

痠

0.60

 allegedly

0.59

|$,

0.59

$+

0.59

$,

0.58

POSITIVE LOGITS

robust

0.76

 গঠ

0.76

Much

0.75

ľ

0.74

Variant

0.73

 postdoc

0.73

Bunch

0.72

 Normalize

0.71

stamp

0.71

Brass

0.70

Activations Density 0.043%