INDEX

Explanations

defining descriptive terms

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 باعث

0.46

 важли

0.38

 deleterious

0.37

 કારણે

0.37

睍

0.36

 Generals

0.35

 причиной

0.35

 leuke

0.34

ങ

0.34

 murderous

0.33

POSITIVE LOGITS

 using

0.59

 utilizzando

0.53

 according

0.52

 consisting

0.52

 encoded

0.51

:<

0.49

 identified

0.49

 utilizando

0.49

 USING

0.48

 consists

0.48

Activations Density 0.487%