INDEX

Explanations

offering further information or predictions

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 cleanest

0.85

 want

0.75

Why

0.75

WHY

0.75

쩡

0.74

 understand

0.73

why

0.73

烃

0.72

 proton

0.72

 histology

0.71

POSITIVE LOGITS

orphic

0.69

</h4>

0.67

anha

0.64

पती

0.62

Predict

0.59

占

0.59

াহী

0.58

 predictive

0.58

fill

0.58

 ელ

0.57

Activations Density 0.079%