INDEX

Explanations

barely acknowledged or not

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ük

0.42

利用

0.42

 الفر

0.40

 majoring

0.39

 sidelined

0.39

 ब्रैकेट

0.39

 Unfortunately

0.37

 неболь

0.37

<sub>

0.36

dependent

0.36

POSITIVE LOGITS

 siquiera

0.74

 coher

0.67

 coherent

0.63

 explicação

0.55

 consciously

0.54

表情

0.50

 fysis

0.49

 Worte

0.49

 comprehend

0.48

 woorden

0.48

Activations Density 0.041%