INDEX

Explanations

concepts related to moral standards and choices

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

-0.07

á»ķi

-0.06

ugg

-0.06

 Seam

-0.06

hiba

-0.06

XX

-0.06

lish

-0.06

Smy

-0.06

POSITIVE LOGITS

alternate

0.08

isd

0.07

IEnumerator

0.07

mime

0.07

memset

0.06

 uncomment

0.06

 Ø¹Ø²ÛĮØ²

0.06

çīĩ

0.06

vfs

0.06

ĶåĽŀ

0.06

Activations Density 0.003%