INDEX

Explanations

information related to safety and risk, specifically regarding health and medical conditions

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Goodman

-0.07

Æ°

-0.07

andom

-0.07

éĶ¦

-0.07

 GOODMAN

-0.07

oton

-0.07

erg

-0.07

oten

-0.07

ednou

-0.06

Ïİ

-0.06

POSITIVE LOGITS

 whether

0.10

 signs

0.10

 potential

0.10

æĺ¯åĲ¦

0.09

 presence

0.09

 possible

0.09

 Signs

0.08

 æĺ¯åĲ¦

0.08

 Whether

0.07

whether

0.07

Activations Density 0.014%