INDEX

Explanations

references to viruses and biological threats

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

.scalablytyped

-0.08

 Marble

-0.07

èĩº

-0.07

áº£nh

-0.07

ÏĥÏĥ

-0.07

ÏĥÎ®

-0.07

ships

-0.07

ÑĢÐµÐ±

-0.07

ignum

-0.06

utters

-0.06

POSITIVE LOGITS

 crown

0.07

uzzi

0.06

 Nature

0.06

Nat

0.06

ifr

0.06

frau

0.06

wald

0.06

adolu

0.06

 origin

0.05

Gos

0.05

Activations Density 0.005%