INDEX

Explanations

research papers

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

:params

-0.28

)__

-0.23

éķŀ

-0.23

/Dk

-0.22

UnderTest

-0.22

æĭľå¸ĪåŃ¦

-0.22

*Math

-0.21

ropri

-0.21

:"-"`↵

-0.21

:normal

-0.21

POSITIVE LOGITS

 studies

0.35

 published

0.34

 literature

0.29

 scattered

0.27

published

0.26

 reference

0.26

 litter

0.25

 publications

0.25

 references

0.25

åĽ½åĨħå¤ĸ

0.25

Activations Density 2.944%