INDEX

Explanations

majority, all, important

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

`:

0.50

 Routing

0.48

il

0.44

 Diffraction

0.44

0.43

0.42

 Signals

0.42

：

0.42

 Estimation

0.40

 Railroad

0.40

POSITIVE LOGITS

 большинстве

0.49

 அனைத்து

0.46

 важное

0.45

 എല്ലാ

0.44

 বেশির

0.43

 heterosexual

0.43

 অধিকাংশই

0.43

 ideology

0.43

 тільки

0.43

 большинство

0.43

Activations Density 0.078%