INDEX

Explanations

phrases related to accusations and claims of wrongdoing

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ENO

-0.08

¶Į

-0.07

ijd

-0.07

uxtap

-0.07

ounter

-0.07

reff

-0.07

isOk

-0.07

Ð´Ð°Ð¼

-0.07

Ã¡j

-0.07

raÅ¾

-0.07

POSITIVE LOGITS

 being

0.11

 having

0.11

 Being

0.08

being

0.07

Being

0.07

iron

0.07

 Having

0.07

 bÃ½t

0.07

having

0.06

 Ãªtre

0.06

Activations Density 0.018%