INDEX

Explanations

resources for further exploration

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 ذریعے

0.65

<unused1034>

0.64

!:

0.64

<unused345>

0.63

aż

0.63

才是

0.62

<unused662>

0.62

而且

0.61

 razy

0.60

 namesake

0.59

POSITIVE LOGITS

;-

1.40

:-

1.19

;-

1.02

:-

1.02

0.94



0.93

;;

0.93

////

0.91

 निम्न

0.90

ঃ

0.89

Activations Density 0.109%