INDEX

Explanations

allegedly, supposedly, arguments

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

fichier

0.73

 ótimo

0.73

ann

0.72

 hepat

0.71

<iframe>

0.71

頰

0.70

 pasti

0.70

Ẽ

0.70

excellent

0.69

 lovely

0.69

POSITIVE LOGITS

 according

1.98

 According

1.85

 allegedly

1.79

According

1.62

 supposedly

1.59

according

1.56

 якобы

1.54

 menurut

1.52

 apparently

1.51

 argues

1.50

Activations Density 0.069%