INDEX

Explanations

comparative phrases indicating superiority or preference

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

gaard

-0.08

rah

-0.08

isp

-0.07

adlo

-0.07

SSIP

-0.07

cult

-0.07

Ã¨les

-0.07

prit

-0.07

elong

-0.07

sWith

-0.07

POSITIVE LOGITS

'gc

0.07

ãģĬãĤĬ

0.07

aja

0.06

ipur

0.06

 ifndef

0.06

 those

0.06

ifold

0.06

ê»ĺ

0.06

åļ

0.06

olic

0.06

Activations Density 0.017%