INDEX

Explanations

cleaning actions with adverbs

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ASK

0.39

 বান্ধবী

0.38

 apprent

0.38

 coursework

0.35

 заве

0.34

ዥ

0.34

ուս

0.33

(";

0.32

POSITIVE LOGITS

 thoroughly

0.93

 Thorough

0.71

 vigorously

0.66

 тщательно

0.64

彻底

0.63

 gently

0.61

 diligently

0.60

 cuidadosamente

0.59

 carefully

0.55

 lightly

0.54

Activations Density 0.024%