INDEX

Explanations

two-letter abbreviations

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 потрібно

-0.90

ें

-0.89

bB

-0.84

写真は

-0.81

nF

-0.77

ே

-0.77

 warnings

-0.77

sS

-0.76

 fortfarande

-0.76

 Bedarf

-0.76

POSITIVE LOGITS

Inputs

0.93

 Indeed

0.92

Univers

0.85

Į

0.84

 intensiv

0.84

 matematik

0.84

decided

0.84

<.

0.83

indeed

0.82



0.82

Activations Density 0.040%