INDEX

Explanations

Technical/scientific topics

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 greateſt

-0.93

 auffi

-0.78

 myſelf

-0.77

 itſelf

-0.75

 Monfieur

-0.74

 ainfi

-0.73

 beſt

-0.70

 intentionally

-0.69

 whoſe

-0.69

 themſelves

-0.69

POSITIVE LOGITS

providedIn

0.64

0.59

et

0.54

InjectAttribute

0.50

es

0.50

 Wall

0.50

 Person

0.50

0.49

 dane

0.48

niająca

0.48

Activations Density 0.635%