INDEX

Explanations

greetings and offers of help

tokens produced by the assistant (model) — i.e., parts of assistant responses / model-generated turns.

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

linkCell

0.54

 ವಿರುದ್ಧ

0.52

 repeated

0.49

👎

0.49

 removing

0.48

 weaker

0.48

 cytotoxicity

0.48

 rejecting

0.47

 dampak

0.46

导致

0.46

POSITIVE LOGITS

 본격

0.71

 готовы

0.68

Welcome

0.62

 bienvenue

0.61

これから

0.60

 bienvenidos

0.60

 Ready

0.59

 готова

0.59

 이곳

0.59

 Welcome

0.58

Activations Density 0.837%