INDEX

Explanations

terms related to threats or violence

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Whilst

-0.08

 whilst

-0.07

.detach

-0.07

-0.06

Whilst

-0.06

 Bren

-0.06

 Lastly

-0.06

 amongst

-0.05

å«

-0.05

 PartialView

-0.05

POSITIVE LOGITS

quet

0.08

jez

0.08

leta

0.07

juana

0.07

ruh

0.07

fuck

0.07

ende

0.07

amac

0.07

orre

0.06

leton

0.06

Activations Density 0.000%