INDEX

Explanations

description or specific details

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 moral

0.42

 мора

0.41

كره

0.40

 élevés

0.39

여

0.39

 morals

0.39

moral

0.38

明

0.38

 altos

0.37

foam

0.37

POSITIVE LOGITS

 Voici

0.47

Details

0.47

 detailed

0.47

 подробно

0.46

の詳細

0.44

 specified

0.43

details

0.42

இதில்

0.41

 Details

0.41

 подроб

0.40

Activations Density 0.228%