INDEX

Explanations

references to "secrets" and their implications in various contexts

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

SSIP

-0.07

Ð»Ð°Ð±

-0.07

alon

-0.07

ITA

-0.07

aan

-0.07

itas

-0.06

abus

-0.06

clr

-0.06

.struts

-0.06

æ¦

-0.06

POSITIVE LOGITS

 secrets

0.13

 secret

0.12

 Secrets

0.12

 Secret

0.10

Secret

0.09

-secret

0.09

(secret

0.08

secret

0.08

 Successful

0.08

ç§ĺ

0.08

Activations Density 0.011%