INDEX

Explanations

references to gambling issues and associated support resources

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

uada

-0.08

èŁ

-0.07

YSTEM

-0.07

.OrderByDescending

-0.07

onse

-0.07

nop

-0.07

è³ŀ

-0.07

deaux

-0.07

hurst

-0.06

enou

-0.06

POSITIVE LOGITS

 harm

0.10

 Problem

0.09

hel

0.09

 responsible

0.08

 problem

0.08

 Responsible

0.08

 compuls

0.08

 harms

0.08

hel

0.07

 Harm

0.07

Activations Density 0.004%