INDEX

Explanations

words indicating possibility and difficulty

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 можна

0.84

假如

0.79

 тех

0.74

 ermöglichen

0.74

 можно

0.74

బ్యా

0.71

 bekannt

0.71

 квадра

0.71

$.

0.69

我要

0.69

POSITIVE LOGITS

 struggled

1.98

 hesitated

1.90

 unsure

1.87

 hesitant

1.77

 struggles

1.76

 confused

1.74

 struggle

1.72

 struggling

1.71

 perplexed

1.68

 puzzled

1.67

Activations Density 0.260%