INDEX

Explanations

assertions for equality and truthiness

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Henderson

0.41

 Daphne

0.40

ampoo

0.39

scape

0.39

 frances

0.39

 Franc

0.38

 Hall

0.38

颈

0.38

 संपूर्ण

0.38

الي

0.38

POSITIVE LOGITS

Equal

0.78

 equal

0.71

 égal

0.71

 Equal

0.70

 Gleich

0.66

equal

0.66

False

0.64

 igualdad

0.63

 равен

0.61

NotNull

0.59

Activations Density 0.003%