INDEX

Explanations

phrases that indicate a sense of obligation or purpose

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

itas

-0.06

eld

-0.06

raph

-0.06

áº£y

-0.06

clusions

-0.06

edin

-0.06

ima

-0.06

rompt

-0.06

arsity

-0.06

lessly

-0.06

POSITIVE LOGITS

unately

0.10

 instance

0.10

bid

0.09

 example

0.08

give

0.07

zier

0.07

given

0.07

hiba

0.07

instance

0.06

geo

0.06

Activations Density 0.032%