INDEX

Explanations

phrases related to accountability and control

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

uem

-0.08

 DÃ¢n

-0.07

¦Ĥ

-0.07

snake

-0.07

LayoutManager

-0.07

(=)

-0.07

otu

-0.07

zeich

-0.07

Î»Ïī

-0.07

undi

-0.07

POSITIVE LOGITS

/of

0.06

 fault

0.06

 involved

0.06

Boh

0.05

 Minor

0.05

 usual

0.05

ign

0.05

antro

0.05

.sin

0.05

à¸¸

0.05

Activations Density 0.003%