INDEX

Explanations

to achieve or work towards

New Auto-Interp

Top Features by Cosine Similarity

Configuration

Prompts (Dashboard)

10,000 prompts, 128 tokens each

Dataset (Dashboard)

lmsys/lmsys-chat-1m

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

icha

-0.10

-0.09

atz

-0.09

 replic

-0.09

eba

-0.09

 adequ

-0.09

 tuned

-0.09

 gains

-0.08

Swe

-0.08

POSITIVE LOGITS

è¾¾

0.19

éģĶ

0.19

å®ŀçİ°

0.17

 Ð´Ð¾ÑģÑĤÐ¸

0.16

 Äĳáº¡t

0.16

è¾¾åĪ°

0.15

 alcan

0.15

 towards

0.15

 Ð´Ð¾ÑģÑĤÐ¸Ð³

0.14

SMART

0.14

Activations Density 0.102%