INDEX

Explanations

programming identifiers followed by a dot

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

här

-0.97

and

-0.96

 here

-0.93

 after

-0.91

 from

-0.85

 Vaters

-0.84

 här

-0.82

arcas

-0.82

หมด

-0.82

 medži

-0.82

POSITIVE LOGITS

これが

0.99

unto

0.97

扩张

0.96

܇

0.91

 garante

0.91

それでは

0.90

 BORN

0.90

 roland

0.90

 WORTH

0.90

Ã

0.90

Activations Density 0.008%