INDEX

Explanations

negative phrases related to action or inaction

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

pd

-0.07

 Ashe

-0.07

ãģĮãģĬ

-0.06

leon

-0.06

rollo

-0.06

adlo

-0.06

brig

-0.06

aÃ§

-0.06

ICY

-0.06

idge

-0.06

POSITIVE LOGITS

upa

0.07

oire

0.06

upert

0.06

fusion

0.06

ØªÙĪ

0.06

anton

0.06

isoft

0.06

ToBounds

0.06

ancel

0.06

Activations Density 0.038%