INDEX

Explanations

words following titles or specific nouns

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ב

1.03

 Validación

0.95

 Virology

0.93

ور

0.92

Мі

0.92

槛

0.91

 Seguridad

0.91

 trasera

0.91

ୃ

0.90

 Apostolic

0.90

POSITIVE LOGITS

one

0.88

たとえば

0.88

 club

0.75

 дня

0.74

ого

0.73

0.72

д

0.70

to

0.70

doc

0.70

Activations Density 0.002%