INDEX

Explanations

references to incidents of sexual violence and their victims

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

éŁ¿

-0.08

asl

-0.07

 sÃ¢n

-0.07

morgan

-0.07

contres

-0.07

addock

-0.07

 murdering

-0.07

.decorate

-0.07

ancode

-0.07

ali

-0.07

POSITIVE LOGITS

kid

0.07

 left

0.07

 attacked

0.07

 subjected

0.07

 beaten

0.07

 subject

0.07

 beat

0.06

 forced

0.06

 twice

0.06

atars

0.06

Activations Density 0.022%