INDEX

Explanations

avoiding negative outcomes

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

倝

0.38

 enumer

0.36

 concre

0.35

酽

0.35

 kucing

0.35

Nathan

0.34

 মাঝি

0.33

^{\#}\

0.33

ल्लभ

0.32

ഞ്ചി

0.32

POSITIVE LOGITS

 Eventually

0.35

めます

0.35

かります

0.35

 hiszen

0.34

 Está

0.34

Если

0.33

اردوش

0.33

ąg

0.33

ln

0.32

ようになりました

0.32

Activations Density 0.080%