INDEX

Explanations

instances of manipulation or distortion of information

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

imit

-0.08

ãĥ³ãĤ¹

-0.07

ikk

-0.06

eker

-0.06

 Matth

-0.06

umer

-0.06

.BO

-0.06

dma

-0.06

 ØŃÚ©Ùħ

-0.06

OPTIONS

-0.05

POSITIVE LOGITS

Ã¶rt

0.07

gebn

0.07

/dist

0.07

benh

0.06

itis

0.06

AIT

0.06

 inorder

0.06

.Stretch

0.06

 Geschichte

0.06

 inconvenient

0.06

Activations Density 0.027%