INDEX

Explanations

phrases indicating known truths or widely accepted facts

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

oder

-0.07

ieri

-0.07

 Perr

-0.07

addle

-0.06

æŁ´

-0.06

 ones

-0.06

aley

-0.06

erson

-0.06

 bubble

-0.06

-door

-0.06

POSITIVE LOGITS

ANDLE

0.07

áº¥n

0.07

ignon

0.07

èº«

0.07

atta

0.06

CLU

0.06

ergus

0.06

illez

0.06

andler

0.06

edido

0.06

Activations Density 0.032%