INDEX

Explanations

phrases related to allegations of sexual misconduct or assault

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

abbo

-0.08

ãĥŃãĥ¼

-0.07

rane

-0.07

arest

-0.07

eft

-0.07

remen

-0.07

assin

-0.07

ErrorMsg

-0.06

Ð»Ð¾Ñĩ

-0.06

bsolute

-0.06

POSITIVE LOGITS

-alone

0.07

 alone

0.07

alone

0.07

 isolated

0.07

 naked

0.06

 modeling

0.06

Cons

0.06

 modelling

0.06

ãĥ¼ãĥĵ

0.06

 unlocked

0.06

Activations Density 0.006%