INDEX

Explanations

Acceptance Criteria

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 narration

0.39

 narrator

0.37

 nick

0.37

塡

0.37

 valor

0.36

 integrations

0.36

 المبلغ

0.36

褒

0.36

fam

0.36

 squirrel

0.35

POSITIVE LOGITS

 Acceptance

1.16

Accept

1.04

 acceptance

1.02

acceptance

1.00

 aceptación

0.93

 Accept

0.93

ACCEPT

0.93

accept

0.84

 accept

0.82

 ACCEPT

0.82

Activations Density 0.002%