INDEX

Explanations

evidences

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 CreateTagHelper

-0.98

)"),

-0.93

ArrowToggle

-0.93

InjectAttribute

-0.91

 Majefty

-0.90

 myſelf

-0.90

 pleaſure

-0.89

 defaultstate

-0.89

extAlignment

-0.89

WireFormat

-0.86

POSITIVE LOGITS

0.51

0.49

and

0.48

IS

0.48

on

0.48

 when

0.46

of

0.44

ha

0.44

Ha

0.43

to

0.43

Activations Density 0.638%