INDEX

Explanations

explaining concepts or definitions

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 creando

0.46

 skapa

0.46

 pikiran

0.45

 thyme

0.42

 마음

0.42

 perturbation

0.41

 perverse

0.41

 ಮನ

0.40

ہنی

0.40

([])

0.39

POSITIVE LOGITS

ちなみに

0.57

 Moreover

0.55

也就是说

0.54

 Interestingly

0.51

 Specifically

0.50

mittedly

0.50

 உண்மையில்

0.50

 Notably

0.49

 actually

0.49

 Essentially

0.49

Activations Density 0.613%