INDEX

Explanations

introducing claims and theories

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Player

0.39

 потребуется

0.38

处理

0.36

 处理

0.36

 প্রতিটি

0.35

CPU

0.34

如果我们

0.34

處理

0.34

組み合わせ

0.34

আমরা

0.33

POSITIVE LOGITS

認為

0.69

 critics

0.67

认为

0.65

 detractors

0.57

 считают

0.57

 proponents

0.55

 mengatakan

0.55

Critics

0.55

认为是

0.53

 zeggen

0.52

Activations Density 0.185%