INDEX

Explanations

mathematical symbols or punctuation

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

もしか

-0.88

ABIL

-0.77

カスタム

-0.75

$^{\

-0.74

arza

-0.73

𝑲

-0.73

 poate

-0.73

Alter

-0.72

нення

-0.71

アドレス

-0.71

POSITIVE LOGITS

孰

0.91

mengg

0.85

Hiya

0.84

 encontraron

0.83

 ziyaret

0.83

 soportar

0.82

 Researchers

0.82

0.81

Ottimo

0.81

 Hiram

0.81

Activations Density 0.020%