INDEX

Explanations

two-letter abbreviations

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

con

-0.88

agaimana

-0.80

手間

-0.78

呼ぶ

-0.77

 each

-0.77

后

-0.76

 finally

-0.75

con

-0.75

 initially

-0.75

locke

-0.74

POSITIVE LOGITS

ཫ

1.03

woners

0.97

ソニー

0.94

Stateless

0.93

centric

0.93

graphers

0.92

ʕ

0.92

 drôle

0.91

här

0.90

getE

0.90

Activations Density 0.037%