INDEX

Explanations

in

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 problems

-0.50

 unbearable

-0.48

 difficulties

-0.47

UVWXYZ

-0.47

θρω

-0.44

ticides

-0.44

 hardships

-0.43

 aggravated

-0.43

ื่อง

-0.43

 nitrates

-0.43

POSITIVE LOGITS

 intptr

0.71

Билгалдахарш

0.63

 nemlig

0.63

FunctionFlags

0.63

Controllo

0.63

zugehen

0.63

 redan

0.59

postsleuth

0.59

Viitteet

0.59

thâu

0.59

Activations Density 0.001%