INDEX

Explanations

directions and additions

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 reconcile

0.70

 disparate

0.69

 knowing

0.67

 neutrons

0.67

 ignorance

0.67

 distract

0.62

 proclaiming

0.62

 discourage

0.60

‟

0.59

 disruptive

0.59

POSITIVE LOGITS

 moze

0.65

puede

0.65

ultimo

0.64

Agregar

0.63

 selatan

0.63

oeste

0.63

ajout

0.63

east

0.62

 आंकड़ा

0.61

 Aynı

0.60

Activations Density 0.036%