INDEX

Explanations

terms that describe discomfort or irritation

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

lify

-0.08

erif

-0.08

odzi

-0.08

Luz

-0.07

uitka

-0.07

inka

-0.07

lake

-0.07

stral

-0.07

çĬ¯ç½ª

-0.07

_FLUSH

-0.06

POSITIVE LOGITS

ingly

0.10

ancel

0.08

antal

0.07

ulence

0.07

ants

0.06

.opensource

0.06

ative

0.06

xe

0.06

ours

0.06

OptionsMenu

0.06

Activations Density 0.005%