INDEX

Explanations

words followed by specific phrases

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 uzyska

-0.75

 eingerichtet

-0.74

ｗｗｗ

-0.74

kových

-0.73

 exorbitant

-0.71

Rota

-0.71

Somer

-0.71

骛

-0.70

iVar

-0.70

リスク

-0.70

POSITIVE LOGITS

 ピンク

0.79

 Figs

0.76

 Coaches

0.74

なのが

0.72

 ресто

0.72

Dip

0.71

 Lucie

0.71

{#

0.71

cru

0.70

 watchdog

0.70

Activations Density 0.009%