INDEX

Explanations

highlighting, truthful, short, improve, converting

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Shields

0.38

 Shield

0.38

Sheet

0.38

 shield

0.36

Shield

0.35

tywn

0.35

 shields

0.35

 weakened

0.35

ologis

0.34

圣

0.34

POSITIVE LOGITS

 کاب

0.43

inoza

0.41

 cabinets

0.40

ransition

0.39

 बुल

0.39

Ste

0.39

厸

0.39

lemagne

0.38

 escalator

0.38

 hemicontinuous

0.38

Activations Density 0.000%