INDEX

Explanations

programmatic elements, lists, and non-English words

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

-2.48

 some

-2.33

of

-2.23

 where

-2.03

-1.96

our

-1.85

 these

-1.81

on

-1.73

 Some

-1.71

These

-1.66

POSITIVE LOGITS

긔

2.36

 první

2.09

atized

1.98

 ofta

1.95

atization

1.95

</tr>

1.92

 DOMINGO

1.92

illir

1.86

lograph

1.86

;</

1.84

Activations Density 0.056%