INDEX

Explanations

phrases questioning knowledge or awareness

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 even

-0.07

 nowhere

-0.07

 anyone

-0.07

any

-0.06

ÑĥÐ¶Ð´

-0.06

if

-0.06

yster

-0.06

statusCode

-0.06

 anywhere

-0.06

guy

-0.05

POSITIVE LOGITS

essler

0.07

icode

0.07

CHED

0.07

bakan

0.07

bek

0.07

plu

0.07

_RB

0.07

iar

0.07

bows

0.07

arious

0.06

Activations Density 0.007%