INDEX

Explanations

select the correct option

New Auto-Interp

Top Features by Cosine Similarity

Configuration

Prompts (Dashboard)

10,000 prompts, 128 tokens each

Dataset (Dashboard)

lmsys/lmsys-chat-1m

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 requis

-0.08

 synd

-0.08

ecome

-0.08

lements

-0.08

adm

-0.08

idas

-0.08

Impossible

-0.08

ÂĢÂĢ

-0.08

æĺĵ

-0.08

 unint

-0.08

POSITIVE LOGITS

 correct

0.21

 option

0.19

 appropriate

0.18

ropriate

0.14

correct

0.13

 Option

0.13

æŃ£ç¡®

0.13

appropriate

0.13

 proper

0.13

 best

0.12

Activations Density 0.031%