INDEX

Explanations

elements related to risk and safety in medical context

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

PlotsExplanationShow Test FieldDefault Test Text

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

agt

-0.07

ediator

-0.07

áº¹

-0.07

coop

-0.07

ORB

-0.06

nota

-0.06

 hometown

-0.06

mitt

-0.06

phet

-0.06

ê

-0.06

POSITIVE LOGITS

 science

0.09

 Regulation

0.09

 Regulatory

0.09

 regulators

0.08

 regulatory

0.08

 Science

0.08

 regulation

0.08

 scientific

0.08

 scient

0.08

 "...

0.07

Activations Density 0.008%