INDEX

Explanations

references to Justin Bieber and associations with controversies or negative incidents

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 cancers

-0.07

 remar

-0.07

çµ

-0.07

SELL

-0.07

_fb

-0.07

abra

-0.06

%p

-0.06

 mast

-0.06

FB

-0.06

 æ¬

-0.06

POSITIVE LOGITS

DM

0.08

ior

0.07

iales

0.07

DM

0.07

etto

0.07

apon

0.06

Ã¨

0.06

endor

0.06

MT

0.06

 Backup

0.06

Activations Density 0.004%