INDEX

Explanations

review following information

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 reacting

-0.81

 Mexicana

-0.80

 κάν

-0.79

 Minangkabau

-0.79

زو

-0.78

&.

-0.78

 mexicana

-0.78

 maux

-0.78

龙头

-0.77

cijas

-0.77

POSITIVE LOGITS

htdocs

0.82

 Hate

0.80

FB

0.77

 dioses

0.75

zine

0.73

 cuarta

0.72

anglement

0.71

瀑

0.71

por

0.71

kubernetes

0.71

Activations Density 0.046%