INDEX

Explanations

using specific tools and methods

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 acteurs

0.53

규모

0.53

 ပြော

0.52

 nevoia

0.52

 vendeurs

0.52

 ľudí

0.51

 нәрсә

0.50

 людей

0.50

 jurisdict

0.50

 👀

0.50

POSITIVE LOGITS

 software

0.72

 standard

0.64

using

0.63

software

0.62

 using

0.59

standard

0.59

 modified

0.59

modified

0.58

 method

0.55

TM

0.55

Activations Density 0.015%