INDEX

Explanations

possessive forms and phrases indicating ownership or experiences

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

NESS

-0.07

 ÑĩÐµÐ»Ð¾Ð²ÐµÐº

-0.07

 itself

-0.07

enÄĽ

-0.07

ÐµÐ»Ð¾Ð²

-0.07

agrid

-0.07

ulumi

-0.07

 sÃ¡m

-0.07

InstanceOf

-0.07

ivial

-0.06

POSITIVE LOGITS

 minds

0.09

 themselves

0.09

 lives

0.08

 efforts

0.08

 hearts

0.07

ongyang

0.07

 rights

0.07

 choices

0.07

 favorite

0.07

 ability

0.07

Activations Density 0.022%