INDEX

Explanations

specific leading words and codes

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

selfobj

0.32

$)$.

0.29

रावर

0.27

Hfn

0.27

ampaikan

0.27

logrus

0.27

sadquotes

0.27

acchati

0.26

.³

0.26

 Ciebie

0.26

POSITIVE LOGITS

 other

0.41

 also

0.39

 کے

0.31

to

0.31

 другие

0.31

 lainnya

0.30

 otros

0.29

 andere

0.29

 inoltre

0.29

 के

0.28

Activations Density 0.297%