INDEX

Explanations

phrases that reflect personal identity and individual attributes

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

utin

-0.07

zos

-0.07

uzey

-0.07

kla

-0.07

rides

-0.07

ritel

-0.07

upertino

-0.07

 Winds

-0.06

pais

-0.06

atrice

-0.06

POSITIVE LOGITS

 individuals

0.16

 individual

0.12

 Individuals

0.11

individual

0.11

 humans

0.11

 human

0.10

Individual

0.09

 people

0.09

 person

0.09

 Ð»ÑİÐ´Ð¸Ð½Ð¸

0.09

Activations Density 0.049%