INDEX

Explanations

axioms and definitions

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 conteú

0.54

Toyota

0.50

 которого

0.46

atthakath

0.46

વેશ

0.45

屬於

0.45

 necessária

0.44

铷

0.44

 configura

0.44

➔

0.44

POSITIVE LOGITS

 people

0.48

 mountains

0.47

KL

0.46

 मैं

0.44

 comparing

0.44

我很

0.43

 Mountain

0.43

 mountain

0.42

 lovers

0.42

Activations Density 0.006%