INDEX

Explanations

personal pronouns and subsequent words

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

どのような

0.95

 해당

0.76

 bidirectional

0.76

人员

0.75

áles

0.75

の種類

0.75

当該

0.73

 दर्शाता

0.71

琀

0.70

үндө

0.70

POSITIVE LOGITS

 clumsy

0.97

 myself

0.95

 smiled

0.95

 grin

0.94

 همیشه

0.94

 weary

0.94

 curls

0.92

 Knows

0.89

 suave

0.89

 didn

0.89

Activations Density 0.110%