INDEX

Explanations

legal appeals/academic papers

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 appeal

-1.27

 Appeal

-1.12

 APPEAL

-1.07

appeal

-0.98

Appeal

-0.93

 attention

-0.83

 appealed

-0.68

 APPEALS

-0.67

 Attention

-0.65

+#+#

-0.65

POSITIVE LOGITS

of

0.54

 braccia

0.52

in

0.52

 ervan

0.49

 braccio

0.49

 őket

0.49

šet

0.47

 with

0.46

 makl

0.46

 costumi

0.46

Activations Density 0.040%