INDEX

Explanations

proper nouns, especially names related to personal identities and relationships

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

eldon

-0.08

WR

-0.07

ofi

-0.07

curity

-0.07

raÃ§

-0.07

 bout

-0.07

eldo

-0.06

kup

-0.06

mÄĽ

-0.06

eka

-0.06

POSITIVE LOGITS

-turned

0.07

ensis

0.07

ovna

0.06

ANNOT

0.06

wand

0.06

enÃ¡

0.06

Ú¯Ø§Ø±

0.06

LOUD

0.06

ħn

0.06

BA

0.06

Activations Density 0.040%