INDEX

Explanations

mentions of violent stabbing or stabbing incidents

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

DebuggerNonUser

-0.74

webElementGuid

-0.60

AccessorTable

-0.60

 opérateurs

-0.60

 Públicas

-0.59

 économies

-0.59

SourceChecksum

-0.57

 unstable

-0.57

oa̍t

-0.56

 незавершена

-0.56

POSITIVE LOGITS

 stab

3.50

stab

2.55

 Stab

2.14

Stab

1.95

 stabbed

1.77

 stabbing

1.77

捅

0.60

 slashes

0.59

Hochspringen

0.57

jab

0.57

Activations Density 0.002%