INDEX

Explanations

concepts related to diversity and inclusion

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

icolon

-0.07

upy

-0.07

ÑĥÐ»Ñı

-0.07

rix

-0.07

ĥĿ

-0.07

aryl

-0.07

-alist

-0.07

uw

-0.07

áº«n

-0.07

ipation

-0.07

POSITIVE LOGITS

 diversity

0.11

 Diversity

0.09

 divers

0.09

 diverse

0.08

 contributions

0.08

 Ã§eÅŁit

0.07

 differences

0.07

div

0.07

 everyone

0.07

 Contributions

0.07

Activations Density 0.018%