INDEX

Explanations

references to the concept of "the."

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Narr

-0.07

æ°¸ä¹ħ

-0.07

Ã¡p

-0.07

.lex

-0.07

stroy

-0.06

 sprav

-0.06

 Kirby

-0.06

 à¤¹à¤µ

-0.06

Æ¡

-0.06

ode

-0.06

POSITIVE LOGITS

 meaning

0.09

 relation

0.09

 concept

0.08

 relationship

0.08

 Relationship

0.08

 role

0.08

 nature

0.07

relation

0.07

meaning

0.07

 Relation

0.07

Activations Density 0.037%