INDEX

Explanations

avoiding negative qualities

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

pł

0.73

 calma

0.72

淡

0.70

平静

0.67

穏

0.66

認

0.62

 cleanse

0.62

순

0.62

stav

0.62

片

0.61

POSITIVE LOGITS

 bulky

1.48

 uncomfortable

1.45

 cumbersome

1.41

 restrictive

1.38

 bulk

1.35

 restricting

1.22

 unsightly

1.21

Bulk

1.20

 awkward

1.20

bulk

1.19

Activations Density 0.180%