INDEX

Explanations

references to research centers or think tanks

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

à¤¹à¤¨

-0.10

gii

-0.08

eras

-0.08

'&&

-0.08

tiv

-0.07

orney

-0.07

adiator

-0.07

ennen

-0.07

OOK

-0.07

letic

-0.07

POSITIVE LOGITS

 American

0.06

eco

0.06

 soft

0.06

icom

0.06

0.05

 sliding

0.05

 Slide

0.05

ech

0.05

res

0.05

 yáº¿u

0.05

Activations Density 0.006%