INDEX

Explanations

references to data theft and personal information publication laws

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

PlotsExplanationShow Test FieldDefault Test Text

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

erif

-0.06

esch

-0.06

ittel

-0.06

.FontStyle

-0.05

 ÐĦÐ²

-0.05

roller

-0.05

_abstract

-0.05

Ð»ÑĥÐ³

-0.05

connector

-0.05

 ÑĤÑĢÐ°Ð½ÑģÐ¿Ð¾ÑĢÑĤ

-0.05

POSITIVE LOGITS

 revenge

0.14

 Revenge

0.14

 sext

0.13

 nude

0.11

 Nude

0.10

 intimate

0.10

 photos

0.10

 images

0.10

 explicit

0.10

 distribution

0.09

Activations Density 0.013%