INDEX

Explanations

admitting limitations or qualifications

apology and explanation

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

私も

0.78

我就

0.72

那我們

0.70

 আমিও

0.68

我们就

0.68

我在

0.65

我们也

0.64

 நானும்

0.63

我都

0.60

 मैंने

0.57

POSITIVE LOGITS

 unapolog

0.59

 disclaimer

0.52

 Partly

0.50

 sorry

0.50

 ashamed

0.48

 heredity

0.46

 granted

0.46

 recentemente

0.46

 apologise

0.46

 partly

0.46

Activations Density 0.174%