INDEX

Explanations

expressions of self-love, respect, and empowerment

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ze

-0.06

ylon

-0.06

.extensions

-0.06

ze

-0.06

 amateurs

-0.06

 privileged

-0.06

 priv

-0.06

onet

-0.06

izr

-0.06

versed

-0.06

POSITIVE LOGITS

 confidence

0.11

 Confidence

0.10

confidence

0.09

 dignity

0.09

-esteem

0.09

 pride

0.09

Self

0.09

 self

0.09

 SELF

0.08

 Self

0.08

Activations Density 0.038%