INDEX

Explanations

After

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

elmet

-0.22

utschen

-0.20

 prosec

-0.20

ypad

-0.19

åĺŀ

-0.19

alach

-0.19

 cellFor

-0.19

 nomine

-0.19

unakan

-0.18

tractive

-0.18

POSITIVE LOGITS

noon

0.32

 initial

0.30

å®ĮæĪĲåĲİ

0.27

 completion

0.25

effects

0.25

thought

0.24

Ø³Ø©

0.24

"After

0.24

 Initial

0.24

ä¸Ģçķª

0.23

Activations Density 0.998%