INDEX

Explanations

ignoring simplicity and knowledge limitations

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 მათ

0.43

õ

0.43

科學

0.43

 pouring

0.41

 கண்டுபி

0.41

㿟

0.40

 beginning

0.40

 circulating

0.40

 початку

0.39

涜

0.39

POSITIVE LOGITS

 adopts

0.39

 defies

0.39

 financeiros

0.39

Finance

0.38

 shrug

0.38

punk

0.38

ignores

0.38

Modified

0.38

anne

0.38

льны

0.38

Activations Density 0.000%