INDEX

Explanations

help you or ensure

New Auto-Interp

Top Features by Cosine Similarity

Configuration

Prompts (Dashboard)

10,000 prompts, 128 tokens each

Dataset (Dashboard)

lmsys/lmsys-chat-1m

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 liá»ģn

-0.09

aint

-0.09

uns

-0.09

halt

-0.09

 simmer

-0.08

 Blanc

-0.08

oram

-0.08

rel

-0.08

uum

-0.08

 Wass

-0.08

POSITIVE LOGITS

 help

0.19

 expertise

0.17

 assistance

0.16

å¸®

0.13

help

0.13

 giÃºp

0.13

 assist

0.13

å¸®åĬ©

0.12

 helps

0.12

 advice

0.12

Activations Density 0.060%