INDEX

Explanations

consent and agreement

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 intensively

0.43

 aspire

0.38

தைப்

0.38

>∕

0.37

 সতর্ক

0.37

 ശ്രദ്ധ

0.37

Forbidden

0.36

㚼

0.36

 주로

0.35

⸩

0.35

POSITIVE LOGITS

 consent

0.80

consent

0.71

 agreeing

0.68

 confirming

0.67

Consent

0.67

 Consent

0.66

同意

0.66

 consents

0.66

 согласи

0.64

acceptance

0.63

Activations Density 0.040%