INDEX

Explanations

phrases indicating audience engagement or involvement

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ikal

-0.07

egl

-0.07

filer

-0.07

kes

-0.06

tb

-0.06

_nf

-0.06

fal

-0.06

yms

-0.06

pell

-0.06

aÃ§

-0.06

POSITIVE LOGITS

eland

0.07

irth

0.06

ople

0.06

bakan

0.06

lamaz

0.06

 **)&

0.06

ONUS

0.06

Ø§Ø±Ø§ÙĨ

0.06

linger

0.06

pton

0.06

Activations Density 0.002%