INDEX

Explanations

content related to health risks and safety concerns for vulnerable populations

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ãĤ¤ãĤ¯

-0.07

anse

-0.07

 ÑģÐ¾ÑĢ

-0.06

æ²ĸ

-0.06

 Relief

-0.06

 intptr

-0.06

 cakes

-0.06

amation

-0.06

bsd

-0.06

ÏħÎ½

-0.06

POSITIVE LOGITS

 coron

0.08

 safety

0.08

Saf

0.07

 Fatal

0.07

afe

0.07

 deadly

0.07

 Sleep

0.07

 fatal

0.07

 breathing

0.07

 breath

0.07

Activations Density 0.001%