INDEX

Explanations

references to verification, marking, or labeling in various contexts

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

focus

-0.08

adin

-0.07

uby

-0.07

Focus

-0.07

 focus

-0.07

rvine

-0.06

monds

-0.06

æħ

-0.06

 buzz

-0.06

 STRICT

-0.06

POSITIVE LOGITS

stamp

0.13

 stamp

0.12

 mark

0.10

 stamps

0.09

 seal

0.09

 Stamp

0.09

 imprint

0.08

mark

0.08

.stamp

0.08

Stamp

0.08

Activations Density 0.014%