INDEX

Explanations

preference/favor

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

SerializedSize

-0.60

/\.

-0.56

 Congrats

-0.54

 brava

-0.52

ponential

-0.52

MinWidth

-0.51

rews

-0.51

grès

-0.50

]));

-0.50

NameInMap

-0.50

POSITIVE LOGITS

 parlent

0.56

 receber

0.50

 tett

0.48

uncle

0.48

 touristes

0.47

tine

0.47

AndEndTag

0.46

 AssemblyTitle

0.46

 atender

0.46

Treatment

0.45

Activations Density 0.002%