INDEX

Explanations

true statements leading to contradictions

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 потенциа

0.47

φέ

0.46

이

0.45

element

0.44

ája

0.44

𝖗

0.44

का

0.44

 Kald

0.43

готовка

0.42

오

0.42

POSITIVE LOGITS

inyin

0.48

 spiritual

0.47

 Spiritual

0.44

pgamma

0.44

 sphing

0.44

 preached

0.43

 bilirubin

0.43

 prophes

0.43

 موسیقی

0.43

datatables

0.42

Activations Density 0.006%