INDEX

Explanations

refusing sexually explicit content

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 மதிப்பு

0.95

 Paston

0.90

ܢ

0.90

 Major

0.87

 Winston

0.85

 Clarkson

0.84

 Kitts

0.84

__/

0.84

ዘ

0.80

 Cavaliers

0.80

POSITIVE LOGITS

ற்ச

1.09

лась

0.97

лось

0.91

aucet

0.87

 Kail

0.87

쉽

0.86

 निधि

0.83

 normalize

0.82

лся

0.82

tool

0.82

Activations Density 0.134%