INDEX

Explanations

phrases or concepts denoting contradictions or complexities in social situations

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ND

-0.07

Bes

-0.07

iaux

-0.06

Bes

-0.06

ordion

-0.06

atar

-0.06

ÐºÑĢÐ°

-0.06

sna

-0.06

ampa

-0.06

asp

-0.06

POSITIVE LOGITS

ior

0.08

 wonderful

0.07

 notion

0.07

ìłĢ

0.07

ultipart

0.07

 thing

0.07

 little

0.06

 great

0.06

iras

0.06

 incredible

0.06

Activations Density 0.024%