INDEX

Explanations

preventing incidents and crises

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 hegemony

0.48

 synthesis

0.46

 supremacy

0.46

 deuter

0.42

 harmony

0.42

her

0.41

 purified

0.41

 sera

0.40

Æ

0.40

rosis

0.40

POSITIVE LOGITS

बाइक

0.43

 হতাহ

0.42

 সাধারণ

0.41

 பொதுமக்கள்

0.39

近年来

0.39

 ఇటీవల

0.39

ergewöhn

0.38

 inexpensive

0.37

ανα

0.36

eBay

0.36

Activations Density 0.002%