INDEX

Explanations

left, 1, or A followed by punctuation

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

mentioned

-1.02

 toppen

-0.94

 likely

-0.90

 모든

-0.90

cluso

-0.90

 every

-0.87

(*(

-0.86

 hvert

-0.85

 hunde

-0.85

 sämtliche

-0.85

POSITIVE LOGITS

上方

0.91

if

0.91

 seen

0.90

racene

0.90

אם

0.90

 sát

0.90

suz

0.88

 recevoir

0.88

 martie

0.86

近い

0.86

Activations Density 0.012%