INDEX

Explanations

negative terms or phrases indicating the absence or failure of something

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Î¯Î¿ÏĤ

-0.07

zeit

-0.06

rod

-0.06

ncia

-0.06

UNCTION

-0.06

holding

-0.06

Dro

-0.06

halt

-0.06

iously

-0.06

.DropTable

-0.06

POSITIVE LOGITS

rica

0.07

ark

0.07

ABCDE

0.06

cu

0.06

tres

0.06

cum

0.06

orage

0.06

rier

0.06

acco

0.06

_si

0.06

Activations Density 0.025%