INDEX

Explanations

positive adjectives

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 mêmes

-0.50

 autorytatywna

-0.47

 contenus

-0.45

 conseguenza

-0.43

новништво

-0.42

Ante

-0.42

 aikaa

-0.41

 aussieht

-0.41

 вещей

-0.41

Ad

-0.40

POSITIVE LOGITS

are

0.76

>{@

0.74

 ويكيميديا

0.73

 externi

0.69

 they

0.69

DrawerToggle

0.68

 were

0.65

 allAfrica

0.63

UnusedPrivate

0.60

 HttpHeaders

0.59

Activations Density 0.001%