INDEX

Explanations

concepts related to social and ethical standards

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

hips

-0.07

ialis

-0.07

fra

-0.07

inde

-0.06

itis

-0.06

ernen

-0.06

ivity

-0.06

odge

-0.06

buff

-0.06

 odds

-0.06

POSITIVE LOGITS

ative

0.11

atively

0.10

 setters

0.08

/INFO

0.08

folio

0.08

Ã¡lnÃŃ

0.08

 dÄ±ÅŁÄ±

0.08

cy

0.07

Setter

0.07

LayoutConstraint

0.07

Activations Density 0.009%