INDEX

Explanations

numbers and punctuation

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

to

-1.58

 declaró

-1.50

коля

-1.48

-1.46

 confirmó

-1.45

кож

-1.42

 reveló

-1.41

but

-1.41

之意

-1.41

 menyadari

-1.40

POSITIVE LOGITS

 borracha

1.83

我们

1.66

 ralla

1.62

 their

1.57

two

1.53

her

1.50

one

1.49

him

1.48

 three

1.48

 seven

1.47

Activations Density 0.008%