INDEX

Explanations

evaluating good and bad

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

0.80

0.61

</span>

0.54

 处理

0.53

 Arco

0.51

ের

0.51

ensitivity

0.50

/>;

0.50

Handles

0.50

Response

0.49

POSITIVE LOGITS

good

0.76

 GOOD

0.71

GOOD

0.71

 good

0.70

ά

0.70

ור

0.67

좋

0.63

Good

0.63

and

0.61

善

0.61

Activations Density 0.069%