INDEX

Explanations

references to shocking or disturbing content

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

abol

-0.07

ature

-0.07

infos

-0.07

 Bakan

-0.07

umi

-0.07

æĲŃ

-0.07

Ð»Ð¸ÑĪ

-0.06

Ð½Ð¸Ð½

-0.06

æĿŁ

-0.06

grant

-0.06

POSITIVE LOGITS

ãģı

0.07

 modal

0.06

anz

0.06

/Runtime

0.06

0.05

 domic

0.05

aal

0.05

bas

0.05

McL

0.05

Activations Density 0.001%