INDEX

Explanations

contrasting statements or caveats

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 mutta

0.47

 pero

0.46

িয়৷

0.46

但我

0.46

 ஆனால்

0.45

但是我

0.45

 Nhưng

0.45

ஆனால்

0.44

但

0.44

 actitud

0.43

POSITIVE LOGITS

Equal

0.43

 außergewöhn

0.40

までの

0.38

Throws

0.37

Conceptual

0.37

Dangerous

0.37

 extraordinary

0.36

Chim

0.36

 novelty

0.36

 يصبح

0.36

Activations Density 0.117%