INDEX

Explanations

column structure and content

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

邦

-0.82

FINAL

-0.78

 السعود

-0.78

禺

-0.76

reszcie

-0.76

 Consolidation

-0.75

íně

-0.75

又

-0.74

兵

-0.74

血

-0.73

POSITIVE LOGITS

CardView

0.96

队友

0.90

kete

0.84

~=

0.80

hover

0.79

col

0.77

 Trotz

0.77

的服务

0.76

 remarqu

0.74

してほしい

0.73

Activations Density 0.008%