INDEX

Explanations

phrases indicating prerequisites or conditions that must be met before taking action

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

å¾Ħ

-0.07

ãĤħ

-0.06

och

-0.06

_flush

-0.06

 Hava

-0.06

onica

-0.06

ingen

-0.06

è¿ĺæĺ¯

-0.06

ãģªãģĮãĤī

-0.06

Composition

-0.06

POSITIVE LOGITS

can

0.08

 progress

0.08

any

0.08

æīįèĥ½

0.07

 anything

0.07

 proceed

0.07

åı¯ä»¥

0.07

 Progress

0.06

 else

0.06

 proceeded

0.06

Activations Density 0.014%