INDEX

Explanations

words related to danger and risks

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

istory

-0.08

my

-0.07

è®©æĪĳ

-0.07

 maybe

-0.07

 saya

-0.06

.nano

-0.06

seau

-0.06

tester

-0.06

à¸ļà¸²à¸¥

-0.06

ãĢĤæĪĳ

-0.06

POSITIVE LOGITS

 âĹ

0.10

{{

0.09

 Nobody

0.07

"{{

0.07

 trope

0.07

([[

0.06

kovÄĽ

0.06

 DependencyProperty

0.06

ibia

0.06

ifact

0.06

Activations Density 0.003%