INDEX

Explanations

statistics and alarming findings regarding safety and health concerns

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ijd

-0.09

ichick

-0.07

SystemService

-0.07

 incididunt

-0.07

 incel

-0.07

Ã¤ll

-0.07

ÃŃg

-0.07

.='

-0.06

Ã¶l

-0.06

inus

-0.06

POSITIVE LOGITS

 knowledge

0.08

 awareness

0.07

 quiz

0.07

 ignorance

0.07

 correct

0.06

çŁ¥è¯Ĩ

0.06

 Knowledge

0.06

 fame

0.06

 answers

0.06

knowledge

0.06

Activations Density 0.006%