INDEX

Explanations

phrases suggesting permission or encouragement

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

aggi

-0.07

urum

-0.07

loff

-0.07

aison

-0.07

unft

-0.07

 Thumb

-0.07

æľĽ

-0.07

addock

-0.06

 Ying

-0.06

POSITIVE LOGITS

 Glob

0.07

.scalablytyped

0.06

achable

0.06

 fran

0.06

tered

0.06

 glob

0.06

 Stream

0.06

Ð½Ð°Ñĩ

0.06

ÑĢÑĸÐ¿

0.06

me

0.06

Activations Density 0.011%