INDEX

Explanations

sexually suggestive content

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Moreover

0.42

 hemodynamic

0.42

adapter

0.41

ગવાન

0.40

 Indeed

0.39

 apoptosis

0.38

臘

0.38

 afternoons

0.38

 மட்டுமல்ல

0.38

 denaturation

0.38

POSITIVE LOGITS

请

0.40

Security

0.40

Please

0.39

Registered

0.38

 registrada

0.38

 lütfen

0.38

Bitte

0.37

 sintet

0.37

syntax

0.37

ทะเบียน

0.37

Activations Density 0.003%