INDEX

Explanations

ethical and responsible AI

The pattern involves words or phrases that describe qualities or actions related to being principled, morally sound, or aligned with established standards and norms. This includes references to responsibility, ethics, appropriateness, respect, following guidelines, maintaining safety, being constructive, vulnerability expressed properly, commitment to positive values, sophistication, comprehensive approaches, natural methods, and general descriptions of proper conduct or alignment with accepted practices.

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

0.42

("

0.40

0.36

("

0.34

 bulbs

0.34

 using

0.34

 Normalmente

0.33

0.32

比如

0.32

/"

0.31

POSITIVE LOGITS

 enthr

0.39

 unforgettable

0.38

 indulgence

0.36

 undeniably

0.35

 uncompromising

0.35

embrance

0.34

 undoubtedly

0.34

 decad

0.34

之际

0.33

 autumnal

0.33

Activations Density 2.965%