INDEX

Explanations

phrases related to certainty and assertion in arguments or statements

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

anco

-0.07

taire

-0.07

enco

-0.07

ymi

-0.07

ccione

-0.07

ixon

-0.06

åª

-0.06

Byte

-0.06

emark

-0.06

&type

-0.06

POSITIVE LOGITS

 hereby

0.07

istrovstvÃŃ

0.06

 actually

0.06

ayscale

0.06

utow

0.06

astos

0.06

Slides

0.06

 again

0.06

 here

0.06

 gonna

0.06

Activations Density 0.002%