INDEX

Explanations

strong ethical and moral considerations, particularly around sensitive and harmful topics.

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

。

0.21

 veya

0.21

 หรือ

0.20

 أو

0.19

或

0.19

；

0.18

或其他

0.18

หรือ

0.17

POSITIVE LOGITS

并通过

0.16

そして

0.16

时间和

0.16

 albeit

0.16

and

0.16

 encouraged

0.15

and

0.15

supported

0.14

 включая

0.14

정과

0.14

Activations Density 1.520%