INDEX

Explanations

references to coercive actions or allegations of misconduct

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

antity

-0.08

affen

-0.08

ardin

-0.07

 {\↵

-0.07

Ã´n

-0.07

rocess

-0.07

 simplex

-0.07

urry

-0.07

ngth

-0.07

ennent

-0.07

POSITIVE LOGITS

appropri

0.06

 Genius

0.06

inski

0.06

 unwanted

0.06

 creep

0.06

 inappropriate

0.06

mented

0.06

 ilma

0.06

aged

0.06

way

0.05

Activations Density 0.004%