INDEX

Explanations

code and descriptions

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

KommentareTeilen

-0.91

 mystery

-0.81

 Majefty

-0.78

rungsseite

-0.77

 createState

-0.75

DeleteBehavior

-0.74

 استنادى

-0.73

 الحره

-0.72

 صوتيه

-0.71

 uſed

-0.71

POSITIVE LOGITS

ins

0.56

my

0.42

leg

0.40

Ins

0.38

Ins

0.37

 meus

0.36

 encu

0.35

En

0.35

omb

0.35

 Vapor

0.34

Activations Density 0.000%