INDEX

Explanations

comparisons

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 bonté

-0.68

/**

-0.68

+#+#

-0.68

 trône

-0.65

 avoient

-0.65

 trouvera

-0.63

AccessorTable

-0.63

AndEndTag

-0.63

]--;

-0.63

 fourrure

-0.63

POSITIVE LOGITS

0.56

way

0.50

Hentet

0.45

 ways

0.45

 terms

0.44

 style

0.43

 pace

0.43

 place

0.42

 space

0.42

 fashion

0.40

Activations Density 0.003%