INDEX

Explanations

phrases indicating reasons or justifications

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

alles

-0.07

htt

-0.06

urum

-0.06

ermann

-0.06

olie

-0.06

.raise

-0.06

alte

-0.06

IOC

-0.06

 Realm

-0.06

umbn

-0.06

POSITIVE LOGITS

 justified

0.11

 warranted

0.10

 rightly

0.09

 indeed

0.09

 deserved

0.09

reasonable

0.09

Reason

0.09

reason

0.09

 understandable

0.09

çĲĨçĶ±

0.09

Activations Density 0.088%