INDEX

Explanations

phrases expressing intentions or desires related to actions

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

orre

-0.08

xin

-0.07

arium

-0.07

á»Ĳ

-0.07

ISCO

-0.07

aln

-0.07

landa

-0.07

usercontent

-0.07

ercul

-0.06

Äĥr

-0.06

POSITIVE LOGITS

vais

0.08

0.07

grad

0.07

enaire

0.06

 odds

0.06

 consider

0.06

ors

0.06

.UnitTesting

0.06

 look

0.06

 consideration

0.06

Activations Density 0.007%