INDEX

Explanations

just from, just testing

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

};

0.22

lewati

0.21

());

0.21

 erforder

0.21

ethyst

0.20

this

0.20

iterranean

0.20

 ambayo

0.20

cribed

0.19

့

0.19

POSITIVE LOGITS

 purely

0.37

 merely

0.33

แค่

0.30

 simply

0.29

 mere

0.28

 단순히

0.28

单纯

0.28

 aesthetics

0.26

 simplemente

0.26

 simplement

0.26

Activations Density 0.979%