INDEX

Explanations

words and phrases related to social interactions and identities

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ostel

-0.07

abee

-0.07

cion

-0.07

okud

-0.07

osta

-0.06

orre

-0.06

pta

-0.06

nut

-0.06

iston

-0.06

unar

-0.06

POSITIVE LOGITS

 will

0.12

’ll

0.11

'll

0.11

 sáº½

0.10

will

0.10

arÃ¡

0.09

å°±ä¼ļ

0.09

 bÄĻdzie

0.08

æľĥ

0.08

gnore

0.08

Activations Density 0.041%