INDEX

Explanations

punctuation

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

âļĹ

-0.32

grily

-0.28

å®¶éĥ½çŁ¥éģĵ

-0.27

 Prostit

-0.25

è¯´å®ŀ

-0.24

çĽĳåĲ¬é¡µéĿ¢

-0.24

 ÙĪÙĩÙĨØ§

-0.24

æīĢæīĢ

-0.24

åĪ©çĶ¨æĤ¨çļĦ

-0.24

¡´

-0.23

POSITIVE LOGITS

èĢĮè¿Ļ

0.40

so

0.40

èĢĮ

0.38

 which

0.37

 while

0.37

and

0.35

but

0.35

 thus

0.33

èĢĮä¸Ķè¿ĺ

0.33

 thereby

0.32

Activations Density 2.001%