INDEX

Explanations

avoids negative or excessive outputs

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 unmistakable

0.19

 with

0.18

 corresponds

0.17

 pourront

0.16

 example

0.16

 contiguous

0.16

(~

0.16

 combination

0.16

 ngunit

0.16

 luminance

0.16

POSITIVE LOGITS

 blindly

0.24

 needlessly

0.22

 complicate

0.21

过度

0.21

 unduly

0.21

濫

0.21

 महंगे

0.20

 jeopard

0.20

 immoral

0.20

 exces

0.20

Activations Density 4.829%