INDEX

Explanations

"Could you" followed by a request

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 estamos

0.38

 estábamos

0.36

 இனி

0.36

 estaremos

0.35

 Estamos

0.34

irish

0.34

 klingt

0.34

 currentPage

0.34

怸

0.34

多かった

0.33

POSITIVE LOGITS

 please

0.81

给我

0.80

 explain

0.77

pls

0.74

 give

0.72

 help

0.70

給我

0.70

 provide

0.70

帮忙

0.69

 пожалуйста

0.68

Activations Density 0.017%