INDEX

Explanations

in

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

oriously

-0.25

åľ°ä¸Ĭ

-0.24

comings

-0.24

ughs

-0.24

çİ°åľº

-0.23

ç½ĳä¸Ĭ

-0.23

oretical

-0.22

à¸Ĳà¸²à¸Ļ

-0.21

-bordered

-0.21

-ves

-0.21

POSITIVE LOGITS

 order

0.32

 terms

0.30

ventario

0.29

 accordance

0.28

serter

0.27

herits

0.27

:black

0.27

 case

0.26

the

0.26

 spite

0.25

Activations Density 2.884%