INDEX

Explanations

phrases related to safety and security protocols

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ses

-0.07

side

-0.07

ylon

-0.07

Ø®Ø§ÙĨÙĩ

-0.07

ëŀĳ

-0.06

ÎµÎ¯ÏĦÎµ

-0.06

Ð¶Ð´

-0.06

ph

-0.06

kur

-0.06

uy

-0.06

POSITIVE LOGITS

 ráº±ng

0.11

0.08

 that

0.08

 bahwa

0.08

eru

0.08

that

0.07

oucher

0.07

ÑģÑĮ

0.07

ments

0.07

/prom

0.07

Activations Density 0.016%