INDEX

Explanations

mathematical calculations across languages

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

her

0.51

…

0.50

fl

0.49

im

0.49

av

0.48

 personal

0.48

 criminal

0.48

mit

0.47

au

0.47

men

0.47

POSITIVE LOGITS

 값을

0.77

umlahan

0.76

値を

0.76

Substituting

0.76

 numberWith

0.75

Arithmetic

0.74

 हमे

0.74

 করিয়৷

0.73

題目

0.73

modulo

0.73

Activations Density 0.301%