INDEX

Explanations

phrases related to consent and voluntary actions

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

iben

-0.08

errat

-0.06

rello

-0.06

afort

-0.06

ti

-0.06

Voy

-0.06

rios

-0.06

ombs

-0.06

rette

-0.06

ibble

-0.06

POSITIVE LOGITS

arda

0.07

ames

0.07

ãĥ¼ãĥ©

0.07

aÄŁa

0.07

otate

0.06

ftware

0.06

èº«

0.06

Ð½Ð°ÑĤ

0.06

csi

0.06

 Vulner

0.06

Activations Density 0.076%