INDEX

Explanations

references to euphemisms and swearing

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ByExample

-0.07

acon

-0.07

ervals

-0.07

 deduct

-0.06

hazi

-0.06

inden

-0.06

 slash

-0.06

unan

-0.06

ee

-0.06

POSITIVE LOGITS

hur

0.07

 à¤ħà¤¶

0.07

abin

0.07

 taboo

0.07

 ÑĥÐ¿Ð¾ÑĤÑĢÐµÐ±

0.07

 offensive

0.06

 Offensive

0.06

 ejected

0.06

æ±¡

0.06

ç

0.06

Activations Density 0.022%